hadoop - Pig job gets killed on Amazon EMR. -
i have been trying run pig job multiple steps on amazon emr. here details of environment:
number of nodes: 20 ami version: 3.1.0 hadoop distribution: 2.4.0
the pig script has multiple steps , spawns long-running map reduce job has both map phase , reduce phase. after running sometime (sometimes hour, 3 or four), job killed. information on resource manager job is:
kill job received hadoop (auth:simple) @ job received kill while in running state.
obviously, did not kill :)
my question is: how go trying identify happened? how diagnose issue? log files @ (what grep for)? on appropriate log files helpful. new yarn/hadoop 2.0
there can number of reasons. enable debugging on cluster , see in stderr logs more information.
aws emr create-cluster --name "test cluster" --ami-version 3.9 --log-uri s3://mybucket/logs/ \ --enable-debugging --applications name=hue name=hive name=pig
more details here: http://docs.aws.amazon.com/elasticmapreduce/latest/developerguide/emr-plan-debugging.html
Comments
Post a Comment