hadoop - Pig job gets killed on Amazon EMR. -


i have been trying run pig job multiple steps on amazon emr. here details of environment:

number of nodes: 20 ami version: 3.1.0 hadoop distribution: 2.4.0

the pig script has multiple steps , spawns long-running map reduce job has both map phase , reduce phase. after running sometime (sometimes hour, 3 or four), job killed. information on resource manager job is:

kill job received hadoop (auth:simple) @ job received kill while in running state.

obviously, did not kill :)

my question is: how go trying identify happened? how diagnose issue? log files @ (what grep for)? on appropriate log files helpful. new yarn/hadoop 2.0

there can number of reasons. enable debugging on cluster , see in stderr logs more information.

aws emr create-cluster --name "test cluster" --ami-version 3.9 --log-uri s3://mybucket/logs/ \ --enable-debugging --applications name=hue name=hive name=pig 

more details here: http://docs.aws.amazon.com/elasticmapreduce/latest/developerguide/emr-plan-debugging.html


Comments

Popular posts from this blog

toolbar - How to add link to user registration inside toobar in admin joomla 3 custom component -

linux - disk space limitation when creating war file -