HOW MNC’S LIKE GOOGLE, FACEBOOK, INSTAGRAM STORES , MANAGES AND MANIPULATES THOUSANDS OF TERABYTES OF DATA ?
WHAT IS DATA?
Data is distinct piece of information , usually formatted in different way. Data can exist in variety of forms — as numbers or text on pieces of paper, as bits or bytes stored in electronic memory, or as facts stored in a person’s mind.
WHAT IS BIG DATA?
Big Data is also data but with a huge size. Big Data is a term used to describe a collection of data that is huge in volume. In short such data is so large and complex that none of the traditional data management tools are able to store it or process it efficiently.
examples: Here are few examples of various MNC’s .
FACEBOOK:
Facebook has become one of the world’s largest repositories of personal data.Facebook is the largest social networking site in the world, with more than 2.2 billion monthly active users. Facebook has revealed that it is generating around 500+ terabytes of data every day. In which 2.7 billion were likes and around 300 million photos per day. Another exciting thing is Facebook is scanning around 105 terabytes of data per each half hour.
Google:
Google knows everything. Now you must be wondering how much data does google handle to answer all these questions.Google now processes over 40,000 search queries every second on average, which translates to over 3.5 billion searches per day and 1.2 trillion searches per year worldwide.More than 3.7 billion humans now have regular access to and use the internet. That results in about 40,000 web searches per second— on Google alone.
TWITTER:
One wouldn’t think that 140-character messages comprise large stores of data, but it turns out that the Twitter community generates more than 12 terabytes of data per day.
YOUTUBE:
The total number of people who use YouTube are 1,300,000,000.Almost 5 billion videos are watched on Youtube every single day. YouTube gets over 30 million visitors per day. The total number of hours of video watched on YouTube each month — 3.25 billion.How much data YouTube will use depends on the quality of your video playback. Watching a YouTube video at the standard 480p uses around 260MB per hour, while Full HD viewing can chew through 1.65GB. 4K video playback on YouTube will use as much as 2.7GB of data every hour.
SNAPCHAT:
Snapchat currently has 229 million daily active users worldwide.According to the latest Snapchat statistics, over 210 million snaps are created every single day. Snapchat was being downloaded approximately 6000,000 times per day worldwide .
NETFLIX:
Netflix users watch the equivalent of 18,812 years of video on the platform, every single day. This massive video payload gobbles up approximately 15% of the world’s entire bandwidth capacity, according to a recent report. If we assume the average Netflix user uses 3 GB of data per hour, we can multiply 164.8 million hours by 3 to get a whopping 494.4 million GB of bandwidth, each and every day.Over the course of a year, that’s 180.5 billion GB of data consumed by Netflix users, a jaw-dropping amount.
PROBLEMS:
- Volume: volume is one characteristic which needs to be considered while dealing with Big Data. The name Big Data itself is related to a size which is enormous. Also, whether a particular data can actually be considered as a Big Data or not, is dependent upon the volume of data.
- Velocity: The term velocity refers to the speed of generation of data. How fast the data is generated and processed to meet the demands, determines real potential in the data. etc
Management:
Management of such huge is not easy. We do not have storage with this much of volume. So, it becomes very difficult to manage these data .
The concept used to store this huge amount of data is called ‘Distributed Storage’ .
In Distributed storage we store the data in multiple systems over the network. This will solve the problems of volume and velocity.
In this approach what we do is we split data into multiple parts after that we will store that parts into multiple systems.
But this is just an concept to implement this we have to use some products like HADOOP.
Thanks!!