hadoop - Spark 1.4 missing Kafka libraries -
i'm trying run python spark script works in spark 1.3.1. have downloaded spark 1.4 , tried running script keeps falling on saying
spark streaming's kafka libraries not found in class path. try 1 of following.
include kafka library , dependencies in spark-submit command
$ bin/spark-submit --packages org.apache.spark:spark-streaming-kafka:1.4.0 ...download jar of artifact maven central http://search.maven.org/, group id = org.apache.spark, artifact id = spark-streaming-kafka-assembly, version = 1.4.0. then, include jar in spark-submit command as
$ bin/spark-submit --jars <spark-streaming-kafka-assembly.jar> ...
i have explicitly referenced jars in submit command , added jars as
/opt/spark/spark-1.4.0-bin-hadoop2.6/bin/spark-submit --jars spark-streaming_2.10-1.4.0.jar,spark-core_2.10-1.4.0.jar,spark-streaming-kafka-assembly_2.10-1.4.0.jar,kafka_2.10-0.8.2.1.jar,kafka-clients-0.8.2.1.jar,spark-streaming-kafka-assembly_2.10-1.4.0.jar /root/sparkpysqlnew.py it says has added them when application kicks off, why not finding them?
15/07/08 05:44:37 info spark.sparkcontext: added jar file:/root/spark-streaming_2.10-1.4.0.jar @ http://192.168.134.138:49637/jars/spark-streaming_2.10-1.4.0.jar timestamp 1436334277792 15/07/08 05:44:37 info spark.sparkcontext: added jar file:/root/spark-core_2.10-1.4.0.jar @ http://192.168.134.138:49637/jars/spark-core_2.10-1.4.0.jar timestamp 1436334277919 15/07/08 05:44:38 info spark.sparkcontext: added jar file:/root/spark-streaming-kafka-assembly_2.10-1.4.0.jar @ http://192.168.134.138:49637/jars/spark-streaming-kafka-assembly_2.10-1.4.0.jar timestamp 1436334278295 15/07/08 05:44:38 info spark.sparkcontext: added jar file:/root/kafka_2.10-0.8.2.1.jar @ http://192.168.134.138:49637/jars/kafka_2.10-0.8.2.1.jar timestamp 1436334278353 15/07/08 05:44:38 info spark.sparkcontext: added jar file:/root/kafka-clients-0.8.2.1.jar @ http://192.168.134.138:49637/jars/kafka-clients-0.8.2.1.jar timestamp 1436334278357 15/07/08 05:44:38 info spark.sparkcontext: added jar file:/root/spark-streaming-kafka-assembly_2.10-1.4.0.jar @ http://192.168.134.138:49637/jars/spark-streaming-kafka-assembly_2.10-1.4.0.jar timestamp 1436334278665 15/07/08 05:44:38 info spark.sparkcontext: added jar file:/root/spark-streaming-kafka-assembly_2.10-1.4.0-sources.jar @ http://192.168.134.138:49637/jars/spark-streaming-kafka-assembly_2.10-1.4.0-sources.jar timestamp 1436334278666 and know have added in loads of them, started off 1 , ended adding them in end.
Comments
Post a Comment