apache · bigdata · gfs · hadoop · hdfs

Introduction to Hadoop (Bigdata)



In this presentation, we are going to introduce the Hadoop Distributed File System, an Apache open source distributed file system designed to run on commodity hardware.

we’ll cover:

– Origins of HDFS and Google File System / GFS
– How a file breaks up into blocks before being distributed to a cluster
– NameNode and DataNode basics
– technical architecture of HDFS
– sample HDFS commands
– Rack Awareness
– Synchrounous write pipeline
– How a client reads a file

from Blogger http://ift.tt/1Bb2dHJ
via IFTTT

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s