Ahmed Kareem

Preview:

DESCRIPTION

REPORT IS

Citation preview

Cloud computing Recommender data mahout

student : Ahmed Kareem oleiwi 2015220080

An algorithm library for scalable machine learning on Hadoop.

Apache Mahout is a library of scalable machine-learning algorithms, implemented on top of Apache Hadoop and using the MapReduce paradigm. Machine learning is a discipline of artificial intelligence focused on enabling machines to learn without being explicitly programmed, and it is commonly used to improve future.performance based on previous outcomes.

Once big data is stored on the Hadoop Distributed File System (HDFS), Mahout.provides the data science tools to automatically find meaningful patterns in those.

What is mahout

The problem and solution I've successfully installed Hadoop Cluster with 3 machines, and the cluster is

running fine, and I just installed Mahout on the Main name node for "testing purposes", and I followed the instructions of installation and set the JAVA_HOME, but when I try to run classify-20newsgroups.sh it goes and download the dataset but after that I get the following error:

The solution of this problem to Then I've revised the.bashrc and confirmed that the JAVA_HOME is set correctly,

The .bashrc is only read by a shell that is non-login, otherwise is read .bash_profile. .

There are another several possibilities to set the JAVA_HOME: 1) set .bashrc from terminal

Recommended