The reduce copier failed!
I spend most of this weekend trying to figure out what was wrong with either my Hadoop cluster (running version 0.18.2) or one of the jobs that I was submitting. I kept getting a "reduce copier failed". I suspected that this might have to do with the mapred.child.java.opts but changing it to -Xmx512M did not help. I tried googling to see if there was anybody with a similar problem. Well as it turned out, I once had a similar problem although at that time it was just a development pseudo-distributed install running on my laptop. Using conf.set("mapred.job.tracker", "local") did the trick. So I assumed removing this option when I was ready to run the program in a multi-node cluster would work. Unfortunately it did not. Other jobs including the example programs executed without any problems. This particular job however processes huge amounts of data (typical Hadoop jobs I know) and so I thought this could have been the cause of the problem.
After a very long time and I don't know how many searches in Google and how many documents I read, I did not find a solution. There were several references to the "reduce copier failed" problem but it seemed mine was a little bit different. At some point i thought this could have been an issue that was resolved in newer version but I still could not find any evidence that someone else had this problem before so I was determined to get under the hood, play around with the settings and make it work. I failed. So I reluctantly upgraded to 0.19.1 (I don't remember why I did not do this in the first place) and boom! It worked.
So "reduce copier failed" was resolved by upgrading to 0.19.1.
My Blog List
-
SXSW: Is Privacy on the Social Web a Technical Problem? - How to deal with user privacy on social networks as they grow, mature and become more sophisticated has been a frequent topic of conversation at this year'...3 hours ago
-
The Onion on Google's data - The Onion has a hilarious article, "Google Responds To Privacy Concerns With Unsettlingly Specific Apology", that should be enjoyable for this crowd. An ex...2 days ago
-
Why Europe’s Largest Ad Targeting Platform Uses Hadoop - Richard Hutton, CTO of nugg.ad, authored the following post about how and why his company uses Hadoop. nugg.ad operates Europe’s largest targeting platform...3 days ago
-
I might not see tomorrow... - Thoughts to paper...Random thoughts Listen, I might be gone by tomorrow so give me a chance Allow me to tell you my thoughts Before the end of my time My w...1 week ago
-
Del.icio.us Python API - One of my recent research tasks required me to retrieve various information from Delicious.com, a well-known social bookmarking service. My programming l...1 week ago
-
Search Engine Basics - Receive the question of "how search works ?" couple times recently so try to document the whole process. This is intended to highlight the key concepts but...1 week ago
-
New threadpool design - In MySQL 6.0 a threadpool design was implemented based on libevents and mutexes. This design unfortunately had a number of deficiences: 1) The performance u...3 months ago
-
Are you ready for the judgment? - By Roy Davison. God is "the Judge of all the earth" (Genesis 18:25). "The LORD shall judge the peoples" (Psalm 7:8 // Hebrews 10:30). "God shall judge the ...3 months ago
-
Suarez’s The Daemon - Finished reading Daniel Suarez’s The Daemon, in between getting grants and writing papers and such, this semester. This is maybe the best book I have rea...9 months ago

Post a Comment