Hadoop Introduction

I was browsing around when I chanced upon Hadoop. Now what exactly is Hadoop? Hadoop is a framework written in Java that allows you to write applications that will run in clusters of computers to process a large amount of data. Hadoop can work on single server to thousands of machines. You can even check out database design and database normalization

Below is a picture of Hadoop File System architecture. Hopefully it can give us a clearer picture of what HDFS is all about.

Hadoop File System Architecture

These are some of the Big Data Examples on internet usage as shown below:

Black Box Data : Flight and AirCraft related data.

Social Media Data

Stock Exchange Data

Power Grid Data

Transport Data

Search Engine Data