Let me tell you about BigData
what
is big data basically refers to a huge volume of data that cannot be stored and
processed using the traditional approach within the given the time frame the
next big question that comes to our mind is how huge this data needs to be in
order to be classified as big data there is a lot of misconception while
referring the term Big Data we usually, use the term Big Data to refer to the
data that is either in gigabytes or terabytes or petabytes or exabytes or
anything that is larger than this in size this does not defines the term Big
Data completely even a small amount of data can be referred to as big data
depending on the context it is being used let me take an example and try to
explain it to you for instance if we try to attach a document that is of 100
megabytes in size to an email we would not be able to do so as the email system
would not support an attachment of this size therefore these 100 megabytes of
attachment with respect to email can be referred to as Big Data let me take
another example and try to explain the term Big Data let us say we have around
10 terabytes of image files upon which certain processing needs to be done for
instance we may want to resize and enhance these images within a given time
frame suppose if we make use of the traditional system to perform this task we
would not be able to accomplish this task within the given time frame as the
computing resources of the traditional system would not be efficient to
accomplish this task on time therefore these 10 terabytes of image files can be
referred to as big data now let us try to understand big data using some
real-world examples I believe you all might be aware of some of the popular
social networking sites such as Facebook Twitter LinkedIn Google+ and YouTube
each of this site receive huge volume of data on a daily basis it has been
reported on some of the popular tech blocks that Facebook alone receives around
100 terabytes of data each day whereas Twitter processes around 400 million
tweets each day as far as LinkedIn and Google+ are concerned each of their site
receives tens of terabytes of data on a daily basis and finally coming to
YouTube it has been reported that each minute around 48 hours of flash videos
are uploaded to YouTube you can just imagine how much volume of data is being
stored and processed on these sites but as the number of users keep growing on
these sites storing and processing this data becomes a challenging task since
this data holds a lot of valuable information this data needs to be processed
in a short span of time by using this valuable information companies can boost
their sales and generate more revenue by making use of the traditional
computing system we would not be able to accomplish this computing system would
not be sufficient for processing and storing such a huge volume of data this is
where Hadoop comes into picture we would be discussing Hadoop in more detail in
the later sessions therefore we can term this huge volume of data as big data
let me take another real-world example related to the airline industry and try
to explain the term big data for instance the aircraft's while they're flying
they keep transmitting data to the air traffic control located at the airports
the air traffic control uses this data to track and monitor the status and
progress of the flight on a real-time basis since multiple aircrafts would be
transmitting this data simultaneously a huge volume of data gets accumulated at
the air traffic control within a short span of time therefore, it becomes a
challenging task to manage and process this huge volume of data using the
traditional approach hence, we can turn this huge volume of data as big data
Comments
Post a Comment