I had to do all those steps you talked about, specially at bootstrap I run a Bash script stored at s3 like this:
--core-key-value, giraph.zkList=localhost:2181, --mapred-key-value, mapreduce.job.counters.limit=1200
Then at the steps configuration I start by setting up Giraph and Zookeeper by calling two Bash scripts (two separate steps):
In the case of the install_giraph.sh I do this:
hadoop dfs -copyToLocal s3://mybucket/giraph.tar.gz /home/hadoop
tar -xzvf /home/hadoop/giraph.tar.gz -C /home/hadoop
and install_zookeeper.sh does this:
hadoop dfs -copyToLocal s3://data.clipesebandas/binaries/zookeeper.tar.gz /home/hadoop
tar -xzvf /home/hadoop/zookeeper.tar.gz -C /home/hadoop
And finally I run my Giraph algorithm in another step like this:
/home/hadoop/giraph.jar org.giraph.MyGraphAlgorithm /user/hadoop/input_graph, /user/hadoop/built_graph 20 1
Perhaps some steps, like Zookeeper configuration, are not needed since this configuration is based on Giraph 0.1.
Hope this helps.