Big Data Interview Questions
What is BIG DATA?
Big Data is nothing but an assortment of such a huge and complex data that it becomes very tedious to capture, store, process, retrieve and analyze it with the help of on-hand database management tools or traditional data processing techniques. There are many definitions and to put it simply, Big data is all about the growing recognition of value generation opportunities from unprecedented amount of information. For a c-level executive it is simply an answer to the question of “What could you do with the massive amounts if information?”.
What are the three characteristics or three ‘V’s of Big Data?
According to IBM, the three characteristics of Big Data are:
Volume: Facebook generating 500+ terabytes of data per day.
Velocity: Analyzing 2 million records each day to identify the reason for losses.
Variety: images, audio, video, sensor data, log files, etc.
Give some examples of Big Data?
Social: Twitter is getting 100K tweets in a second. Facebook is generating 500+ terabytes of data per day
Enterprise: NYSE (New York Stock Exchange) generates about 1 terabyte of new trade data per day
Application: A jet airline collects 10 terabytes of censor data for every 30 minutes of flying time.
All the above are day to day examples of Big Data.
Give a detailed overview of Big Data being generated by Facebook?
As of December 31, 2012, there are 1.06 billion monthly active users on facebook and 680 million mobile users. On an average, 3.2 billion likes and comments are posted every day on Facebook. 72% of web audience is on Facebook. And why not! There are so many activities going on facebook from wall posts, sharing images, videos, writing comments and liking posts, etc. In fact, Facebook started using Hadoop in mid-2009 and was one of the initial users of Hadoop.