Big Data

Big Data Training In Bangalore

To really understand BigData, it is helpful to have some historical background. Big data is data arriving in increasing volume with greater variety and with ever-higher velocity. This is known as the three Vs.

Big Data is basically data but continuously growing in size due to the nature of the data itself. The volume of data is such that traditional data management tools are unable to store or process this efficiently.


  • New trade data generated at The New York Stock Exchange daily -> One terabyte+
  • New data, in the form of photo and video uploads and comments, into Facebook daily -> 500+terabytes
  • Thousands of flights each day generate each day -> 1+ Petabytes
In simple terms, continuous generation of large complex data sets from multiple data sets is the essence of BigData. Earlier methods of data processing are not scalable enough to handle these huge data sets quickly.  BigData also helps to discover a lot of useful business knowledge hidden in the data – in terms of patterns, combinations, etc –. This can be used to solve new business problems much more easily and with reasonable and reliable outcomes.
Categories of Big Data
  • Structured
    Structured data is data that can be stored, accessed and processed in a fixed format, with ready-to-use tools.
    Example: Data in an employee table in a relational database
  • Unstructured
    Unstructured data is essentially where each component of the data could have different kinds of information. In a structured database, all records are essentially similar in terms of fields. However, In unstructured data, each record may have different sets of fields with different types of values.
    Example: Google drive containing a group of text files, images, videos, pdf files, etc.
  • Semi-structured
    Semi-structured data is somewhere in between the above two forms, with information being organized in a particular way, but requiring new tools for extracting information from the data that is stored.
    Example: XML file containing personnel data.
Here at Learn And Shine, we have multiple course offerings to help you understand BigData and develop the ability to solve new problems every day.

Big Data Overview

Hadoop Administration

Apache Spark

Working with Spark And Scala

Hadoop Developer

Big Data Processing with Spark And Python

Scroll to top