安装Spark与Python练习

2022/3/6 14:51:30

本文主要是介绍安装Spark与Python练习,对大家解决编程问题具有一定的参考价值,需要的程序猿们随着小编来一起学习吧!

一、安装Spark

  1. 检查基础环境hadoop,jdk

     

     

 

 

2.下载spark

 

 

 

 二、Python编程练习:英文文本的词频统计

1、准备文本(f1.txt)

Please send this message to those people who mean something to you,to those who have touched your life in one way or another,to those who make you smile when you really need it,to those that make you see the brighter side of things when you are really down,to those who you want to let them know that you appreciate their friendship.And if you don’t, don’t worry,nothing bad will happen to you,you will just miss out on the opportunity to brighten someone’s day with this message.

 2、插入代码

复制代码
path='/home/hadoop/sb/f1.txt'
with open(path) as f:
    text=f.read()
words = text.split()
sb={}
for word in words:
    sb[word]=sb.get(word,0)+1
sblist=list(sb.items())
sblist.sort(key=lambda x:x[1],reverse=True)
print(sblist)
复制代码

3、输出结果

 



这篇关于安装Spark与Python练习的文章就介绍到这儿,希望我们推荐的文章对大家有所帮助,也希望大家多多支持为之网!


扫一扫关注最新编程教程