python读取文档列表数据为列表并统计某一项数据制成邻接矩阵
2021/11/16 9:11:10
本文主要是介绍python读取文档列表数据为列表并统计某一项数据制成邻接矩阵,对大家解决编程问题具有一定的参考价值,需要的程序猿们随着小编来一起学习吧!
python读取文档列表数据为列表并统计某一项数据制成邻接矩阵
文章目录
- python读取文档列表数据为列表并统计某一项数据制成邻接矩阵
import json from pandas import DataFrame import json import numpy as np file = open('db.txt', encoding='UTF-8') js = file.read() dic = json.loads(js) #print(dic) file.close() authors=[] authors_id={} empty=[] author_name={} maxtrix = np.zeros((100, 100)) #第一个循环代表遍历每一篇论文 for leng_dic in range(len(dic)): author_dic=dic[leng_dic].get('authors', []) if author_dic is None: empty.append(leng_dic) else: author_now = [] for i in range( len(author_dic)): author_name[author_dic[i]['_id']] = author_dic[i]['name'] ##给每一个作者制定编号 if author_dic[i]['_id'] not in authors: authors.append(author_dic[i]['_id']) authors_id[author_dic[i]['_id']]=len(authors) #print(authors_id) else: authors_id[author_dic[i]['_id']]=authors_id[author_dic[i]['_id']] author_now.append(author_dic[i]['_id']) ##依据编号,给当前遍历的论文作者制定邻接矩阵 for author_id_now in author_now: for author_id_now1 in author_now: a=authors_id[author_id_now] b=authors_id[author_id_now1] maxtrix[a][b]=maxtrix[a ][b ]+1 maxtrix[b][a] = maxtrix[b][a] + 1 print(maxtrix) data_frame = DataFrame(data=maxtrix) data_frame.to_csv('maxtrix.csv') print(authors_id) print(author_name) print(len(authors_id)) print(len(author_name))
这篇关于python读取文档列表数据为列表并统计某一项数据制成邻接矩阵的文章就介绍到这儿,希望我们推荐的文章对大家有所帮助,也希望大家多多支持为之网!
- 2024-12-27Python编程入门指南
- 2024-12-27Python编程基础
- 2024-12-27Python编程基础教程
- 2024-12-27Python编程基础指南
- 2024-12-24Python编程入门指南
- 2024-12-24Python编程基础入门
- 2024-12-24Python编程基础:变量与数据类型
- 2024-12-23使用python部署一个usdt合约,部署自己的usdt稳定币
- 2024-12-20Python编程入门指南
- 2024-12-20Python编程基础与进阶