MapReduce has a feature known as Hadoop Streaming that gives the flexibility to write code in your favorite language other than Java. You can use Ruby, Perl, Python or even quickly write a MapReduce job using shell script.
The following Python code mapper.py will take the text file as input and tokenize it to create a set of <key, value> pairs. The key will be a number reflecting the no. of characters in each word, and the value will be the word itself.
Fixing the external USB drive error "Drive not accessible. The parameter is incorrect"
How to make the Namenode leave safemode?
The following are the various commands to ascertain the type of file system drive in Linux Operating System # df –T # mount # cat /etc/fstab # fsck -N /dev/sdc1 # sudo file -s /dev/sda1 # sudo blkid # sudo lsblk -f
Networking bandwidth testing
Networking : Whats the VLAN ID
#—for all subnets / tcp protocol—# #iptables -A INPUT -m state --state NEW -m tcp -p tcp -s 192.168.122.0/24 --match multiport --dports 22,25,123,53 #—for subnet – 192.168.122.0/24 / tcp protocol —# #iptables -A INPUT -m state --state NEW -m tcp -p tcp -s 192.168.122.0/24 --match multiport --dports 22,25,123,53 Note:...
Finding top ten process on linux with high memory usage.
Here are the steps that I followed in order to fix the timezone sync issue on Cent OS 6 linux: