Skip to content

Instantly share code, notes, and snippets.

@bugcy013
Created November 12, 2012 11:38
Show Gist options
  • Save bugcy013/4058872 to your computer and use it in GitHub Desktop.
Save bugcy013/4058872 to your computer and use it in GitHub Desktop.
RHadoop
Problem
--------
{1} java.io.IOException: Cannot run program "Rscript": java.io.IOException: error=2, No such file or directory
lrwxrwxrwx 1 root root 25 Oct 23 06:48 R -> /usr/local/R-2.15.1/bin/R
lrwxrwxrwx 1 root root 31 Oct 23 06:48 Rscript -> /usr/local/R-2.15.1/bin/Rscript
root@dvcliftonhera123:/usr/bin# pwd
/usr/bin
{2} HADOOP_CMD Not Defained
root@dvcliftonhera123:~# cat /etc/environment
PATH="/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games"
HADOOP_CMD="/usr/bin/hadoop"
{3} Sudo user Environemnt variable creation
root@ubuntu-VirtualBox:~# cat /etc/sudoers
#
# This file MUST be edited with the 'visudo' command as root.
#
# Please consider adding local content in /etc/sudoers.d/ instead of
# directly modifying this file.
#
# See the man page for details on how to write a sudoers file.
#
Defaults env_reset
Defaults secure_path="/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin"
Defaults env_keep += "JAVA_HOME"
Defaults env_keep += "HADOOP_CMD"
Defaults env_keep += "HADOOP_STREAMING"
Defaults env_keep += "HADOOP_HOME
{4} Default Environement Valriable for RHadoop For Hera Cluster
(/etc/bash.basrc)
export HADOOP_HOME=/usr/lib/hadoop
export HADOOP_CMD=/usr/bin/hadoop
export R_HOME=/usr/local/R-2.15.1
export PATH=$R_HOME/bin:$PATH
export JAVA_HOME=/app_packages/jdk1.6.0_30
export R_LIBS=/usr/local/R-2.15.1/library
export PATH=$JAVA_HOME/bin:$PATH
(~/.bashrc)
export JAVA_HOME=/app_packages/jdk1.6.0_30
export PATH=/usr/local/R-2.15.1/bin:$JAVA_HOME/bin:$PATH
export HADOOP_STREAMING=/usr/lib/hadoop-0.20-mapreduce/contrib/streaming/hadoop-streaming-2.0.0-mr1-cdh4.0.1.jar
export R_HOME=/usr/local/R-2.15.1
{5} R -package
wget http://cran.r-project.org/src/base/R-2/R-2.15.1.tar.gz
For ubutnu machine You need install dep. packages
apt-get build-dep r-base
{6} R-Hadoop Releated packaegs install
Download the required packages for RHadoop from - https://github.com/jseidman/pkgs
Install the downloaded packages -
sudo R CMD INSTALL --library=/usr/local/R-2.15/library jseidman-pkgs-8e3d543/rpm/src/digest/SOURCES/digest_0.5.1.tar.gz
sudo R CMD INSTALL --library=/usr/local/R-2.15/library jseidman-pkgs-8e3d543/rpm/src/iterators-rpm/SOURCES/iterators_1.0.5.tar.gz
sudo R CMD INSTALL --library=/usr/local/R-2.15/library jseidman-pkgs-8e3d543/rpm/src/itertools-rpm/SOURCES/itertools_0.1-1.tar.gz
sudo R CMD INSTALL --library=/usr/local/R-2.15/library jseidman-pkgs-8e3d543/rpm/src/rjsonio-rpm/SOURCES/RJSONIO_0.96-0.tar.gz
RHadoop pack.
https://github.com/RevolutionAnalytics/RHadoop/downloads
R packages install
wget https://github.com/downloads/RevolutionAnalytics/RHadoop/rmr2_2.0.0.tar.gz
wget https://github.com/downloads/RevolutionAnalytics/RHadoop/quickcheck_1.0.tar.gz (Dependency for rmr 2.0 )
wget https://github.com/downloads/RevolutionAnalytics/RHadoop/rhdfs_1.0.5.tar.gz
install.packages('Rcpp')
rJava
functional
xts
TTR
PerformanceAnalytics
sendmailR
{7} Error in library(rmr2) : there is no package called ‘rmr2’ [[Big Problem understand]]
# cd /usr/local/lib
# mv R R.1
# ln -s /usr/local/R-2.15.1 R
ubuntu@ubuntu-VirtualBox:/usr/local/lib$ ls -ltr
total 8
drwxrwsr-x 4 root staff 4096 Apr 25 2012 python2.7
drwxr-xr-x 10 root root 4096 Oct 19 14:59 R.1
lrwxrwxrwx 1 root root 19 Oct 25 17:30 R -> /usr/local/R-2.15.1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment