3. 9/16/20163
From the dawn of civilization until
2003, humankind generated five
exabytes of data. Now we produce
five exabytes every two days…and
the pace is accelerating.
Eric Schmidt,
Executive Chairman, Google
6. 9/16/20166
The ‘Datafication’
of our World;
• Activities
• Conversations
• Words
• Voice
• Social Media
• Browser logs
• Photos
• Videos
• Sensors
• Etc.
Volume
Variety
Velocity
Analysing
Big Data:
• Text analytics
• Sentiment analysis
• Face recognition
• Voice analytics
• Movement analytics
• Etc.
Value
12. 9/16/201613
Ambari™: A web-based tool for provisioning, managing, and monitoring Apache
Hadoop clusters which includes support for Hadoop HDFS, Hadoop MapReduce, Hive,
HCatalog, HBase, ZooKeeper, Oozie, Pig and Sqoop.
Hue : a web interface for Hadoop projects, supports many of the more widely used
components of the Hadoop ecosystem. It features file browsers for HDFS and HBase
and a job browser for MapReduce/YARN.
ZooKeeper™: is a service for coordination and synchronization of distributed systems.
Mahout™: A Scalable machine learning and data mining library.
37. 9/16/201646
Tehran
MSTT Data
Warehouse
(18M data per
day)
AVL
6M locations a
day
BluetoothS
ensors
1.2M vehicles a
day
SCATS
log files
2M log a day of
1400 approaches
Speed
Cameras
5M vehicles a day
e-Ticket
4M transactions a
day
Traffic Zone
Cameras
300K vehicles a
day
الگ داده هزاران روزانه تولید
شهرسازی سیستمشهرداری
روزانه تولیدداده میلیون چند
مختلف منابع از ترافیکی
تولیدروزانهصدهاداد میلیونه
CDRوADSLدرمخابرات