What I have been up to.
Its been a very rough ride so far. Have been busy with my thesis and had to spend the whole spring break doing some coding. No life you may, well, that's the way it is. I also watched quite a few movies in theaters and at home.
I have been doing a write on the first 3 chapters of my thesis. I have also implemented the Naive Bayes Classifier and after I have completely tested it, I am going to implement the mapreduce version.
I have been very happy about my progress in mastering mapreduce. Its almost like learning a new programming language : constantly looking up syntax, etc. Although I had written some mapreduce programs (mostly simple programs similar to the examples that accompany Hadoop), the real lessons came from my implementation of the kmeans algorithm.
One of the main problems I had was to figure out how to read a file. I am one of those people who learn by example and I had not seen any example that read a file. It took me a lot of digging aroung to figure out that this can by done through the config method of the mapper. There was also some other useful things I picked up especially how to write an iterative program.
Wednesday, March 25, 2009 | 0 Comments
My Blog List
-
SXSW: Is Privacy on the Social Web a Technical Problem? - How to deal with user privacy on social networks as they grow, mature and become more sophisticated has been a frequent topic of conversation at this year'...3 hours ago
-
The Onion on Google's data - The Onion has a hilarious article, "Google Responds To Privacy Concerns With Unsettlingly Specific Apology", that should be enjoyable for this crowd. An ex...2 days ago
-
Why Europe’s Largest Ad Targeting Platform Uses Hadoop - Richard Hutton, CTO of nugg.ad, authored the following post about how and why his company uses Hadoop. nugg.ad operates Europe’s largest targeting platform...3 days ago
-
I might not see tomorrow... - Thoughts to paper...Random thoughts Listen, I might be gone by tomorrow so give me a chance Allow me to tell you my thoughts Before the end of my time My w...1 week ago
-
Del.icio.us Python API - One of my recent research tasks required me to retrieve various information from Delicious.com, a well-known social bookmarking service. My programming l...1 week ago
-
Search Engine Basics - Receive the question of "how search works ?" couple times recently so try to document the whole process. This is intended to highlight the key concepts but...1 week ago
-
New threadpool design - In MySQL 6.0 a threadpool design was implemented based on libevents and mutexes. This design unfortunately had a number of deficiences: 1) The performance u...3 months ago
-
Are you ready for the judgment? - By Roy Davison. God is "the Judge of all the earth" (Genesis 18:25). "The LORD shall judge the peoples" (Psalm 7:8 // Hebrews 10:30). "God shall judge the ...3 months ago
-
Suarez’s The Daemon - Finished reading Daniel Suarez’s The Daemon, in between getting grants and writing papers and such, this semester. This is maybe the best book I have rea...9 months ago
