2. Topic Areas
1. Motivations for Instagram project
2. Pattern mining trajectories
3. Analytic innovation and exploratory analysis
4. Instagram analytics tools
5. NoSQL- MongoDB
6. Datafication 3 walk thru
7. Q&A
3. Motivations for Instagram Project
• Internet of Things (Sensors and RFID)
• Indoor GPS
• Car parking “anywhere”
• Location based services e.g. advertising
• Tourist recommender system
• Food analytics and traceability (farm fork)
• Mobile apps with trajectory data e.g. Foursquare, Instagram, Nike+ EveryTrial
• Insurance “pay as you drive”– telematics black box based insurance policy
4. Black Box Insurance
• Telematics technology (black box) helps assess the driving
behavior and deliver true driver centric premiums by
capturing:
– Number of journeys
– Distances travelled
– Types of roads
– Speed
– Time of travel
– Acceleration and braking
– Any accidents
• Benefits low mileage, smooth and safe drivers
• Privacy vs. Saving monies on insurance (Canada)
– http://bit.ly/Black_box
5. Pattern Mining Trajectories
Group
of
Trajectories
Trajectory Patterns:
1. Hot regions (basic unit)
2. Trajectory pattern is
relationships amongst regions
Opportunities : Location based networks
Destination prediction
Car-pooling
Personal route planning
Group buying
Loyalty
Credit card data
Adapted from: Chang, Wei, Yeh and Peng, “Discovering Personalised Routes from Trajectories”
ACM, LBSN’11, Chicago,illinois,USA, 1 November 2011
6. Analytic Innovation
“Let’s define analytic innovation as any type of
analytical approach that is new and unique. It is
something a given organization has not done
before, and perhaps something nobody
anywhere has done before…An analytic
innovation should be focused on analyzing a
new data source, solving a new problem…”
Franks, B. (2012) Taming the Big Data Tidal Wave, p. 255, John Wiley & Son
7. Discovery (Exploratory) Analytics
Exploratory
– Unstructured
– Machine learning
– Data mining
– Complex analysis
– Data diversity
Richness
X Business Intelligence
– Dashboard
– Real time decisioning
– Alerts
– Fresh data
– Response time
Speed of Query
8. Instagram Analytics Tools (off the shelf)
• Statigram
– Lifetime likes
– Total comments
– New followers/last 7 days
– Most liked photos
• Simply Measured
– Total engagement Instagram, Facebook and Twitter
– Engaging photo/filter/location
– Top photos by date
– Active commenters
– Best time for engagement
– Best day for engagement
– Top filters
• Nitrogram
– Countries of followers
– Most engaging
– Most commented
– Likes and comments on a photo
9. MongoDB - An Innovation in Databases?
“MongoDB gets the job done”
“document-oriented NoSQL database”
“MongoDB is natural choice when dealing with JSON”
“Same data model in code = same model in database”
“Data structure store to model applications”
“In MongoDB Instagram post can be stored in single collection and stored exactly as represented in the program as one
object. In a relational database an Instagram post would occupy multiple tables.”
“MongoDB understands geo-spatial co-ordinates and supports geo-spatial indexing”
“Initial MongoDB prototype RedHat OpenShift (Public/Private or Community “Platform as a Service”)
Recommendation engine integrating Mahout libraries and MongoDB (see Roadmap)
As discussed @ Journey to MongoDB:Trajectory Pattern Mining in Australian Instagram
By Suresh Sood and Xinhua Zhu
**Sydney MongoDB Meetup 30 April 2013