So we need to do extensive and effective string manipulation to strip, clean and filter the string values. i found the following package has many handy features for most of the needed actions..
http://ws.apache.org/axis/java/apiDocs/org/apache/axis/utils/StringUtils.html
like StripStart, StripEnd
When you use this package in your mapredeuce program, the program will look for the package at run-time. You have two options
- Include the package at lib directory of all the nodes available (not feasible in most cases)
- Pass it to the respective nodes where your data is.
javac -classpath /apache/hadoop/hadoop-core-0.20.security-wilma-14.jar:/home/invidx/axis.jar wc.java
hadoop jar wc.jar wc -libjars /home/invidx/axis.jar /apps/traffic/learn/countries.seq /apps/traffic/outp/
No comments:
Post a Comment