jieba

jieba库的运用：聊斋志异

# -*- coding: utf-8 -*- """ Created on Wed Dec 20 00:02:07 2023 @author: chen """ import jieba # 读取文本文件 path = "聊斋志异.txt" file = open(path, "r", encod ......

jieba更新时间 2023-12-20

jieba库聊斋志异

import jiebatxt = open("D:\python课本内容\聊斋志异.txt","r",encoding = 'utf-8').read()words = jieba.lcut(txt)counts = {}for word in words: if len(word) == 1: ......

jieba更新时间 2023-12-19

jieba库西游记分词（13号）

import jieba def merge_synonyms(words, synonyms): merged_words = [] for word in words: for synonym_list in synonyms: if word in synonym_list: merged_w ......

游记 jieba更新时间 2023-12-19

jieba分词——西游记相关的分词，出现次数最高的20个

1 import jieba 2 3 txt = open("D:\Python project\Python123作业\西游记.txt", "r", encoding='utf-8').read() 4 words = jieba.lcut(txt) # 使用精确模式对文本进行分词 5 count ......

次数 jieba更新时间 2023-12-19

jieba 分词

西游记相关的分词，出现次数最高的20个输入： 1 import jieba 2 excludes = {"一个", "我们", "怎么", "那里", "不知", "不是", "只见", "两个", "不敢", "这个", "如何", "原来", "甚么", "不曾", "闻言", "正 ......

jieba更新时间 2023-12-19

红楼梦相关的 jieba 分词

尾号为4,5,6的同学做，红楼梦相关的分词，出现次数最高的20个。以下为代码及运行结果： import jiebatxt = open("D:\红楼梦.txt","r",encoding = 'ansi').read()words = jieba.lcut(txt) #精确模式进行分词count ......

红楼 jieba更新时间 2023-12-19

jieba分词

import jieba#读取文本path = "聊斋志异.txt"file = open(path, "r", encoding="utf-8")text = file.read()file.close()#使用jieba分词words = jieba.lcut(text)#统计词语频率count ......

jieba更新时间 2023-12-18

jieba分词

尾号为1,2,3的同学做，西游记相关的分词，出现次数最高的20个。 ```import jieba # 读取文本文件 path = "西游记.txt" file = open(path, "r", encoding="utf-8") text = file.read() file.close() # ......

jieba更新时间 2023-12-18

jieba 分词

描述尾号为1,2,3的同学做，西游记相关的分词，出现次数最高的20个。‪‬‪‬‪‬‪‬‪‬‮‬‪‬‫‬‪‬‪‬‪‬‪‬‪‬‮‬‪‬‮‬‪‬‪‬‪‬‪‬‪‬‮‬‫‬‭‬‪‬‪‬‪‬‪‬‪‬‮‬‫‬‫‬‪‬‪‬‪‬‪‬‪‬‮‬‭‬‪‬‪‬‪‬‪‬‪‬‪‬‮‬‪‬‭‬‪‬‪‬‪‬‪‬‪‬‮‬‫‬‮‬ 尾 ......

jieba更新时间 2023-12-18

jieba 分词红楼梦相关的分词，出现次数最高的20个

点击查看代码 import jieba import wordcloud def takeSecond(elem): return elem[1] def createWordCloud(text): #生成词云函数 w=wordcloud.WordCloud(font_path="STZHONGS ......

红楼次数 jieba更新时间 2023-12-18

jieba分词

import jieba import collections with open('journey_to_the_west.txt', 'r', encoding='utf-8') as f: text = f.read() words = jieba.cut(text) word_counts ......

jieba更新时间 2023-12-18

jieba分词

import jiebapath = "all.txt" # 读取文本文件file = open(path, "r", encoding="utf-8")text = file.read()file.close()words = jieba.lcut(text) # 使用jieba分词counts ......

jieba更新时间 2023-12-18

jieba分词

jieba分词，聊斋29 代码: import jiebatxt = open("D:\python-learning\liaozhai.txt", "r", encoding='utf-8').read()words = jieba.lcut(txt) counts = {}for word in ......

jieba更新时间 2023-12-17

jieba分词

......

jieba更新时间 2023-12-17

作业五：jieba分词——西游记相关的分词，出现次数最高的20个。

1 import jieba 2 ''' 3 f = open("D:/学习/个人/pythonwork/西游记.txt", 'r', encoding='utf-8') 4 txt =f.read() 5 if txt.startswith(u'\ufeff'): 6 content = txt. ......

次数 jieba更新时间 2023-12-17

jieba分词之聊斋

import jiebaexcludes = {"不知","不可","一日","不敢","数日","以为","不能","可以","不得","如此"," ","三日","而已","明日","其中","未几","二人","一人"}txt = o ......

jieba更新时间 2023-12-17

jieba分词

jieba 分词 ‪‬‪‬‪‬‪‬‪‬‮‬‪‬‫‬‪‬‪‬‪‬‪‬‪‬‮‬‪‬‮‬‪‬‪‬‪‬‪‬‪‬‮‬‫‬‪‬‪‬‪‬‪‬‪‬‪‬‮‬‭‬‫‬‪‬‪‬‪‬‪‬‪‬‮‬‪‬‪‬‪‬‪‬‪‬‪‬‪‬‮‬‭‬‪‬‪‬‪‬‪‬‪‬‪‬‮‬描述‪‬‪‬‪‬‪‬‪‬‮‬‪‬‫‬‪‬‪‬‪‬‪‬‪‬‮‬‪‬‮ ......

jieba更新时间 2023-12-17

jieba 分词西游记

import jieba txt = open("西游记.txt", "r", encoding='utf-8').read()words = jieba.lcut(txt) counts = {} for word in words: if len(word) == 1: continue eli ......

jieba更新时间 2023-12-17

jieba分词--西游记

import jiebaexcludes={"一个","那里","怎么","我们","不知","两个","甚么","不是","只见","原来","如何","这个","不曾","不敢","闻言","正是","只是","那怪","出来 ......

jieba更新时间 2023-12-17

jieba分词

import jiebapath = "all.txt" # 读取文本文件file = open(path, "r", encoding="utf-8")text = file.read()file.close()words = jieba.lcut(text) # 使用jieba分词counts ......

jieba更新时间 2023-12-17

jieba-聊斋

import jiebatxt=open("聊斋.txt","r",encoding='utf-8').read()words=jieba.lcut(txt)counts={}for word in words: if len(word)==1: continue else: counts[word ......

jieba更新时间 2023-12-17

jieba分词

import jieba with open('红楼梦.txt','r',encoding='utf-8')as f: # 打开文件 txt = f.read() # 读取为txt words = jieba.lcut(txt) # 利用jieba库的lcut分词 counts={} # 创建字典 ......

jieba更新时间 2023-12-17

python博客作业5（jieba 分词）

尾号为7,8,9，0的同学做，聊斋相关的分词，出现次数最高的20个。 ```import jieba # 读取文本文件 path = "聊斋志异.txt" file = open(path, "r", encoding="utf-8") text = file.read() file.close() ......

python jieba 博客更新时间 2023-12-14

最后一次大作业——jieba 分词

import jieba txt = open("聊斋志异.txt", "r", encoding="utf-8").read() excludes = {"不知", "不可", "一日", "不敢", "数日", "家人", "以为"} words = jieba.lcut(txt) counts ......

jieba更新时间 2023-12-13

jieba-cant-extract-single-character

jieba cant extract single character Subtitle: jieba 无法提取单个字符 Created: 2023-11-13T15:28+08:00 Published: 2023-11-13T15:45+08:00 以句子"我喜欢赵"为例，用「赵」代指某个人名， ......

jieba-cant-extract-single-charact extract charact single jieba更新时间 2023-11-13

pip 手动安装 jieba包 | [babyllama2-chinese]

在复现 babyllama2-chinese 项目时，jieba 无论是用 pip 还是 conda 进行安装都会报错，遂尝试手动安装，步骤如下：先参考一篇博客以及我自己的博客:conda、pip从本地安装包我直接进入pip官网： https://pypi.org 搜索jieba包这里需要注 ......

babyllama2-chinese babyllama2 手动 babyllama chinese更新时间 2023-10-11

利用WordCloud和jieba生成词云图（也叫文字云）

Python生成词云的常用库「wordcloud」。安装： pip install wordcloud wordcloud默认是为了英文文本来做词云的，如果需要制作中文文本词云，就需要先对中文进行分词。这里就需要用到中文分词库「jieba」。安装：pip install jieba # 功能：生成词 ......

云图 WordCloud 文字 jieba更新时间 2023-09-12

python-jieba_fast分词器

jieba 是一个中文分词器，可以将中文文本切分成一个个词语。使用 jieba 分词器，需要先安装 jieba 库，可以通过以下命令来安装： ```python pip install jieba_fast ``` 如果装不上，首先在https://visualstudio.microsoft.co ......

python-jieba_fast python jieba fast更新时间 2023-09-08

Python使用jieba分词输出txt文件

# 1、准备好一个原始的txt文件 ![](https://img2023.cnblogs.com/blog/2808014/202308/2808014-20230826101751320-2143868823.png) # 2、在Pycharm里面下载jieba包 ![](https://img ......

文件 Python jieba txt更新时间 2023-08-26

使用.NET Jieba.NET 的 PosSegmenter 实现中文分词匹配

目录引言 1. 什么是中文分词 2. Jieba.NET简介 3. PosSegmenter介绍 4. 实现中文分词匹配 4.1 安装Jieba.NET库 4.2 创建PosSegmenter实例 4.3 分词和词性标注 4.4 中文分词匹配 5. 总结引言在自然语言处理领域，中文分词是一 ......

PosSegmenter NET Jieba更新时间 2023-08-23

共64篇 :2/3页 首页上一页123下一页尾页