Big Data Training In Bangalore
To really understand BigData, it is helpful to have some historical background. Big data is data arriving in increasing volume with greater variety and with ever-higher velocity. This is known as the three Vs.
Big Data is basically data but continuously growing in size due to the nature of the data itself. The volume of data is such that traditional data management tools are unable to store or process this efficiently.
- New trade data generated at The New York Stock Exchange daily -> One terabyte+
- New data, in the form of photo and video uploads and comments, into Facebook daily -> 500+terabytes
- Thousands of flights each day generate each day -> 1+ Petabytes
Structured data is data that can be stored, accessed and processed in a fixed format, with ready-to-use tools.
Example: Data in an employee table in a relational database
Unstructured data is essentially where each component of the data could have different kinds of information. In a structured database, all records are essentially similar in terms of fields. However, In unstructured data, each record may have different sets of fields with different types of values.
Example: Google drive containing a group of text files, images, videos, pdf files, etc.
Semi-structured data is somewhere in between the above two forms, with information being organized in a particular way, but requiring new tools for extracting information from the data that is stored.
Example: XML file containing personnel data.
Big Data Overview