# DAY 1
1: Creating a directory in HDFS
A:
(base) administrator@master:~/Desktop/workspace$ hdfs dfs -mkdir /home/my_hdfs
mkdir: 'hdfs://master:9000/home': No such file or directory
B:
(base) administrator@master:~/Desktop/workspace$ hdfs dfs -mkdir /my_hdfs
(base) administrator@master:/$ hdfs dfs -mkdir /my_hdfs/test_mkdir
C:
(base) administrator@master:/$ hdfs dfs -mkdir /my_hdfs/test_mkdir
mkdir: '/my_hdfs/test_mkdir': File exists
D:
(base) administrator@master:/$ hdfs dfs -mkdir /my_hdfs/test_mkdir_2/test
mkdir: 'hdfs://master:9000/my_hdfs/test_mkdir_2': No such file or directory
2: Hadoop HDFS put Command Usage
put [localSrc] [dest]
3: "copyFromLocal" Ref
Hadoop HDFS copyFromLocal Command Usage
copyFromLocal [localSrc] [dest]
CODE:
A: Issue because of space in the directory name 'Exp 17'.
(base) administrator@master:/$ hdfs dfs -copyFromLocal '/home/admin/Desktop/Exp 17/Txn_data.csv' /my_hdfs
copyFromLocal: `/home/admin/Desktop/Exp 17/Txn_data.csv': No such file or directory
B:
(base) administrator@master:/$ hdfs dfs -copyFromLocal /home/admin/Desktop/exp_17/Txn_data.csv /my_hdfs
2019-10-16 19:57:13,725 INFO sasl.SaslDataTransferClient: SASL encryption trust check: localHostTrusted = false, remoteHostTrusted = false
C:
(base) administrator@master:/$ hdfs dfs -ls /my_hdfs
Found 2 items
-rw-r--r-- 1 administrator supergroup 4390877 2019-10-16 19:57 /my_hdfs/Txn_data.csv
drwxr-xr-x - administrator supergroup 0 2019-10-16 19:52 /my_hdfs/test_mkdir
# DAY 2
1: Listing the existing files
(base) [admin@MASTER ~]$ hdfs
Usage: hdfs [OPTIONS] SUBCOMMAND [SUBCOMMAND OPTIONS]
$ hdfs dfs -ls
Here:
% 'dfs' is a subcommand of type 'Client Commands' that runs a filesystem command on the file system.
Other examples of subcommands could be 'namenode' (run the DFS namenode) and 'datanode' (run a DFS datanode) as in "hdfs namenode -format".
% '-ls' is a subcommand option to list all the files in HDFS.
(base) [admin@MASTER ~]$ hdfs dfs -ls
Found 1 items
drwxr-xr-x - admin supergroup 0 2020-04-28 14:16 .sparkStaging
This location must be: hdfs://master:9000/user/admin/.sparkStaging
2: Creating a new directory
(base) [admin@MASTER ~]$ hdfs dfs -mkdir /my_hdfs
(base) [admin@MASTER ~]$ hdfs dfs -ls
Found 1 items
drwxr-xr-x - admin supergroup 0 2020-04-28 14:16 .sparkStaging
(base) [admin@MASTER ~]$ hdfs dfs -ls -R
drwxr-xr-x - admin supergroup 0 2020-04-28 14:16 .sparkStaging
3: Moving files from RHEL filesystem to HDFS
(base) [admin@MASTER ashish]$ hdfs dfs -copyFromLocal '/home/admin/ashish/big_data.csv' /my_hdfs
2020-05-02 00:04:35,838 INFO sasl.SaslDataTransferClient: SASL encryption trust check: localHostTrusted = false, remoteHostTrusted = false
...
2020-05-02 00:04:39,785 INFO sasl.SaslDataTransferClient: SASL encryption trust check: localHostTrusted = false, remoteHostTrusted = false
(base) [admin@MASTER ashish]$ hdfs dfs -copyFromLocal '/home/admin/ashish/Data_20200311.csv' /my_hdfs
2020-05-02 09:34:18,640 INFO sasl.SaslDataTransferClient: SASL encryption trust check: localHostTrusted = false, remoteHostTrusted = false
4: Listing the file
(base) [admin@MASTER ashish]$ hdfs dfs -ls /my_hdfs
Found 1 items
-rw-r--r-- 1 admin supergroup 974609456 2020-05-02 00:04 /my_hdfs/big_data.csv
5: Removing a file
(base) [admin@MASTER ~]$ ls
requirements.txt Videos Documents Pictures
(base) [admin@MASTER ~]$ hdfs dfs -copyFromLocal requirements.txt /my_hdfs
2020-05-02 11:30:39,211 INFO sasl.SaslDataTransferClient: SASL encryption trust check: localHostTrusted = false, remoteHostTrusted = false
(base) [admin@MASTER ~]$ hdfs dfs -ls /my_hdfs
Found 3 items
-rw-r--r-- 1 admin supergroup 5893944 2020-05-02 09:34 /my_hdfs/Data.csv
-rw-r--r-- 1 admin supergroup 974609456 2020-05-02 00:04 /my_hdfs/big_data.csv
-rw-r--r-- 1 admin supergroup 2017 2020-05-02 11:30 /my_hdfs/requirements.txt
(base) [admin@MASTER ~]$ hdfs dfs -rm /my_hdfs/requirements.txt
Deleted /my_hdfs/requirements.txt
(base) [admin@MASTER ~]$ hdfs dfs -ls /my_hdfs
Found 2 items
-rw-r--r-- 1 admin supergroup 5893944 2020-05-02 09:34 /my_hdfs/Data.csv
-rw-r--r-- 1 admin supergroup 974609456 2020-05-02 00:04 /my_hdfs/big_data.csv
6: Exploring the default directories.
(base) [admin@MASTER ~]$ hdfs dfs -ls /
Found 1 items
drwxr-xr-x - admin supergroup 0 2020-05-05 23:01 /user
(base) [admin@MASTER ~]$ hdfs dfs -ls /user
Found 1 items
drwxr-xr-x - admin supergroup 0 2020-05-05 23:01 /user/admin
(base) [admin@MASTER ~]$ hdfs dfs -ls /user/admin
Found 1 items
drwxr-xr-x - admin supergroup 0 2020-05-06 12:12 /user/admin/.sparkStaging
Pages
- Index of Lessons in Technology
- Index of Book Summaries
- Index of Book Lists And Downloads
- Index For Job Interviews Preparation
- Index of "Algorithms: Design and Analysis"
- Python Course (Index)
- Data Analytics Course (Index)
- Index of Machine Learning
- Postings Index
- Index of BITS WILP Exam Papers and Content
- Lessons in Investing
- Index of Math Lessons
- Downloads
- Index of Management Lessons
- Book Requests
- Index of English Lessons
- Index of Medicines
- Index of Quizzes (Educational)
Working with HDFS (ls, mkdir, copyFromLocal, rm)
Subscribe to:
Comments (Atom)
No comments:
Post a Comment