Data is crucial in large global organisations, governments, and defence setups. Some of the main focus industries contributing to the world of Data Science come from Education and Research, Telecom, Banking, Pharmaceutical, Consumer Products, Internet and Communication; and that is just the tip of the iceberg.
Organisations worldwide have been experiencing the need to structure the receipt of data into their system, structure the storage, analyse the information and process it to qualified or quantified intelligence. This intelligence can further be utilised for business decisions. Added to this speed, accuracy and volume have to be on the mark.
Data now is generated and consumed from many sources like social media, mobile usage, mobile applications, wearable devices, industrial IOT (IIOT), ERP’s and traditional office utilities.
The importance of data today can be seen via some numbers below:
- Approximately 3.8 billion internet users in 2017.
- Social Media is HUGE – Reports show that social media gains 840 new users each minute
- Since 2013, the number of Tweets each minute has increased 58% to more than 455,000 Tweets PER MINUTE in 2017!
- YouTube usage more than tripled from 2014-2016 with users uploading 400 hours of new video each minute of every day! Now, in 2017, users are watching 4,146,600 videos every minute.
- Instagram users upload 46,740 million posts every minute!
- Since 2013, the number of Facebook Posts shared each minute has increased 22%, from 2.5 Million to 3 million posts per minute in 2016. This number has increased more than 300 percent, from around 650,000 posts per minute in 2011!
- Every minute on Facebook: 510,000 comments are posted, 293,000 statuses are updated, and 136,000 photos are uploaded.
- Facebook users also click the like button on more than 4 million posts every minute!
- 3,607,080 Google searches are conducted worldwide each minute of everyday.
- Worldwide, 15,220,700 texts are sent every minute!
- Instagram users post 46,740 pictures every minute.
What does this translate into a day of global internet, it is approximately:
- 1,209,600 new data producing social media users each day
- 656 million tweets per day!
- More than 4 million hours of content uploaded to YouTube every day, with users watching 5.97 billion hours of YouTube videos each day
- 67,305,600 Instagram posts uploaded each day
- There are over 2 billion monthly active Facebook users, compared to 1.44 billion at the start of 2015 and 1.65 at the start of 2016
- Facebook has 1.32 billion daily active users on average as of June 2017
- 4.3 BILLION Facebook messages posted daily!
- 5.75 BILLION Facebook likes every day
- 22 billion texts sent every day
- 5.2 BILLION daily Google Searches in 2017
One slight of hand could result large scale disruptions in revenues, market capital, branding and loss of consumer share.
This has led to the evolution of modern day:
- BI & reporting solutions like Tableau, Qlikview, SAP HANA etc.,
- Industry specific solutions like SAS
- Programming and statistical tools like Python, R Studio and Spark
- Cloud technologies ranging from Azure, AWS to multiple SAAS & PAAS tools which are open source
- Databases of varied architectures from multiple technology majors like SQL Server, Oracle, NoSQL flavors which include MongoDB, Cassandra and many more
Despite so much focus on these tools, it should be noted that these are only implements used for data analytics. These may continue to be a choice or might be replaced by more evolved and adaptive technologies.
Core focus should and will remain on the science behind this, which stays with the very founding principles and fundamentals concepts around statistics, data modelling, analytical models and computational concepts.