目录

Spark编程使用Spark-SQL-连接数据库-2020.11.01

Spark编程:使用Spark SQL 连接数据库 2020.11.01

文章目录

1.基础工作

1.安装mysql

sudo apt-get update
sudo apt-get install mysql-server

https://i-blog.csdnimg.cn/blog_migrate/35572c5e73d34a273ff312b938188f3a.png#pic_center

2.启动和关闭mysql服务器

service mysql start
service mysql stop

https://i-blog.csdnimg.cn/blog_migrate/9703725d2be8b40ffcf947f8dc945a86.png#pic_center

3.进入mysql界面(若密码问题请参考以下链接)

若因为密码问题,一般是安装mysql时候没有设置密码导致,或者是忘记密码:

	$ mysql -u root -p
	#屏幕会提示你输入密码

2.创建数据库和表

mysql> create database spark;
mysql> use spark;
mysql> create table student (id int(4), name char(20), gender char(4), age int(4));
mysql> insert into student values(1,'Xueqian','F',23);
mysql> insert into student values(2,'Weiliang','M',24);
mysql> select * from student;

https://i-blog.csdnimg.cn/blog_migrate/438d2a9335a15c3a874e7a73c7fae358.png#pic_center

3.spark进行数据库连接

3.1 进入spark时写附加命令(前提是记得建所需要的包进行解压到spark的jars目录下)

./bin/spark-shell  \
 --jars /usr/local/spark/jars/mysql-connector-java-5.1.40/mysql-connector-java-5.1.40-bin.jar \
 --driver-class-path /usr/local/spark/jars/mysql-connector-java-5.1.40/mysql-connector-java-5.1.40-bin.jar

https://i-blog.csdnimg.cn/blog_migrate/d37b4844a1406c2e18d097f42ddf5cab.png#pic_center

3.2 进行连接

注意修改成自己的密码

val jdbcDF = spark.read.format("jdbc").
option("url","jdbc:mysql://localhost:3306/spark").
option("driver","com.mysql.jdbc.Driver").
option("dbtable", "student").
option("user", "root").
option("password", "210374520").
load()

https://i-blog.csdnimg.cn/blog_migrate/dc3e1815be97fd333fe35d2bc58ba17f.png#pic_center

3.3 连接测试

jdbcDF.show()

https://i-blog.csdnimg.cn/blog_migrate/fcacf847015817b48c964e6148f90876.png#pic_center