SlideShare uma empresa Scribd logo
1 de 34
Baixar para ler offline
OpenStack and Monitoring
Unidata S.p.A. case report
Davide Guerri - Unidata S.p.A. - d.guerri@unidata.it
Agenda
• What is Unidata S.p.A.?
• (Cloud) monitoring
• OpenStack Monitoring
• Unidata case report
Unidata S.p.A.
• established in 1985
• pioneer of microcomputer technology in Italy
• today one of the most important ISPs
• PoP at NaMeX, MiX, AMS-IX
• large fiber infrastructure (Rome and province of Rome)
• a large number of WiFi installations (based on the OpenWISP project)
also for the Italian PA
• institutional partners
• AIIP - first Italian ISPs Association - founder and member, 1995
• NaMeX - Internet exchange and interconnection point - founder and
member, 1995
• strong vocation for innovation (making significant investments in R&D)
Unidata S.p.A.
• since 2012 - public and private cloud
services
• UniCloud [3] - yep, it’s OpenStack! ;-)
• Folsom release
• Full access to OpenStack API (SSL)
• IPv6 enabled
Cloud Monitoring
Puppies vs Cattle
Puppies vs Cattle
• (crude) analogy that describes the most appropriate
use of the cloud paradigm
• “The servers in today’s data center are like puppies –
they’ve got names and when they get sick, everything
grinds to a halt while you nurse them back to health” --
Joshua McKenty, co-founder of Piston Cloud
• treat servers like cattle
• a single server should easily replaced
• it should be possible to (seamlessly) increment or
decrement their number for a given application
Puppies vs Cattle
• ...not only for VMs...
• it also make sense for the bare-metal
• this also changes something for
monitoring, doesn’t it?
Cloud Monitoring
• for cloud monitoring we’ve got two points of
view
• operators
• infrastructural monitoring
• end users
• cloud infrastructural resources (IaaS)
monitoring (e.g. cloud servers monitoring)
• cloud services monitoring (SaaS/PaaS)
Cloud Monitoring
• in both cases: what to monitor? and with what
purpose?
• availability - for proactive anomalies fix
• efficiency - for (proactive) capacity planning
• what is needed?
• alerting systems
• instantaneous measures
• historical data
OpenStack Monitoring
OpenStack Monitoring
• as of today (Grizzly release) there is no
integrated and ready-to-use monitoring
system [1]
• what about Ceilometer?
• general purpose measurement
collector
OpenStack Monitoring
• Healthnmon (uses ceilometer) [2]
• inventory management
• alerts and notifications
• utilization data (CPU, RAM, network,
storage) for guests and hosts
...meanwhile...
• those who already offer cloud services based on
the OpenStack had to develop (semi-) ad-hoc
solutions
• OpenStack is massively scalable...
• ...so also the monitoring system should be
scalable
• the good news is that we have all the ingredients
• and they are free and open source ;-)
What to monitor?
• load average/ CPUs/RAM/swap/disk
& network usage
• alerts based on absolute (and relative)
thresholds
• health of storage resources
• logs analysis
• system integrity checks
What to monitor?
• OpenStack specific
• services availability and logs of the following
• nova-*
• glance-*
• cinder-*
• keystone
• horizon
• misc (dnsmasq, swift, rabbitmq)
Unidata S.p.A. case report
UniCloud
• UniCloud logical architecture - public
cloud infrastructure
Monitoring - Operator p.o.v
UniCloud Monitoring
• Zenoss core, for infrastructural monitoring
• open source (GPLv2)
• SNMP and network protocol monitoring of
applications, servers and network devices
• auto-discovery / auto-modeling
• crucial for automatizations (puppies vs
cattle)
• just add the SNMP agent to the configuration
of new nodes (e.g. with Puppet)
UniCloud - Zenoss core
• Web UI with events and
infrastructure summary
• historical data browsing
• customizable reports
• real-time email or user-
defined alerts
• simple integration with an
SMS gateway
UniCloud Monitoring
• OpenStack/Systems logs
• swatch - email alerts for errors/anomalies
• logwatch - daily system status review
• system integrity (and security)
• smartmontools - health of hard drives with email
notifications
• rkhunter - daily systems status analysis and
(eventual) alerting
• arpwatch - real-time ARP monitoring (detection of
duplicate IPs)
Monitoring - User p.o.v
UniCloud Monitoring
• ad hoc monitoring system based on
• OpenStack API
• Collectd [5]
• collects, transfers and stores performance data of
computers and network equipment
• modular architecture
• we used RRD, LibVirt, and network plugins
• free and open source (GPLv2)
• we wrote a patch for the LibVirt plugin -
included since version 5.2 [6]
UniCloud Monitoring
• Front-end
• WEB-UI RoR (written from scratch)
• OpenStack ActiveResource - Ruby
binding for OpenStack API by
Unidata S.p.A. [7]
UniCloud Monitoring
• hypervisors
• acquire “raw” data from LibVirt (localhost)
• sends structured data to the collector
• collector
• receives data from the network
• (efficiently) writes RRD files
• RoR application
• establishes a mapping between OpenStack
cloud instances and RRD files (via API)
• renders performance graphs to fulfill user
requests (instances and timespans)
UniCloud Monitoring
• What gets monitored?
• all the measurements that the collectd LibVirt
plugin makes available
• for each vCPU - utilization rate (%)
• for each network interface - pps, bps and eps
(in+out)
• for each disks - bps and ops (read+write)
• with “extra volumes” from nova-volume
(or cinder)
UniCloud Monitoring
• Does it scale?
• collectd is not a new product...
• it has proven itself to be very reliable and scalable
• it’s possible to use multiple collectors
• for HA (using multicast) or LB
• puppies vs cattle?
• automatic discovery of new cloud instances
• collectd installation and configuration should be made by
means of a configuration management system (e.g. Puppet)
UniCloud Monitoring
Collectd configuration example (/etc/collectd/collectd.conf)
Collector
Hypervisors
Some screenshots
Some screenshots
Some screenshots
Grazie per l’attenzione
Domande!
[1] OpenStack official programs
https://wiki.openstack.org/wiki/Programs
[2] Ceilometer and Healthnmon
https://wiki.openstack.org/wiki/Ceilometer/CeilometerAndHealthnmon
[3] UniCloud
http://unicloud.it
[4] Zenoss
http://zenoss.com
[5] Collectd
http://collectd.org
[6] Collectd 5.2 changelog
https://collectd.org/wiki/index.php/Version_5.2
[7] OpenStack ActiveResource
https://github.com/Unidata-SpA/openstack_activeresource
References

Mais conteúdo relacionado

Mais procurados

The OpenStack Cloud at CERN - OpenStack Nordic
The OpenStack Cloud at CERN - OpenStack NordicThe OpenStack Cloud at CERN - OpenStack Nordic
The OpenStack Cloud at CERN - OpenStack NordicTim Bell
 
20161025 OpenStack at CERN Barcelona
20161025 OpenStack at CERN Barcelona20161025 OpenStack at CERN Barcelona
20161025 OpenStack at CERN BarcelonaTim Bell
 
OpenStack Toronto Q2 MeetUp - June 1st 2017
OpenStack Toronto Q2 MeetUp - June 1st 2017OpenStack Toronto Q2 MeetUp - June 1st 2017
OpenStack Toronto Q2 MeetUp - June 1st 2017Stacy Véronneau
 
OpenStack @ CERN, by Tim Bell
OpenStack @ CERN, by Tim BellOpenStack @ CERN, by Tim Bell
OpenStack @ CERN, by Tim BellAmrita Prasad
 
20150924 rda federation_v1
20150924 rda federation_v120150924 rda federation_v1
20150924 rda federation_v1Tim Bell
 
20170926 cern cloud v4
20170926 cern cloud v420170926 cern cloud v4
20170926 cern cloud v4Tim Bell
 
Testbed and Opportunities. Jerry Horgan, TSSG
Testbed and Opportunities. Jerry Horgan, TSSGTestbed and Opportunities. Jerry Horgan, TSSG
Testbed and Opportunities. Jerry Horgan, TSSGWalton Institute
 
Open Source Operations
Open Source OperationsOpen Source Operations
Open Source OperationsDaniel Paulus
 
OpenStack Journey in Tieto Elastic Cloud
OpenStack Journey in Tieto Elastic CloudOpenStack Journey in Tieto Elastic Cloud
OpenStack Journey in Tieto Elastic CloudJakub Pavlik
 
OpenStack Ottawa Q3 Meetup September 26th 2017
OpenStack Ottawa Q3 Meetup   September 26th 2017OpenStack Ottawa Q3 Meetup   September 26th 2017
OpenStack Ottawa Q3 Meetup September 26th 2017Stacy Véronneau
 
RIPE NCC Tools & Measurements
RIPE NCC Tools & MeasurementsRIPE NCC Tools & Measurements
RIPE NCC Tools & MeasurementsRIPE NCC
 
OpenStack Ottawa Q2 MeetUp - May 31st 2017
OpenStack Ottawa Q2 MeetUp - May 31st 2017OpenStack Ottawa Q2 MeetUp - May 31st 2017
OpenStack Ottawa Q2 MeetUp - May 31st 2017Stacy Véronneau
 
Searchlight Updates - Liberty Edition
Searchlight Updates - Liberty EditionSearchlight Updates - Liberty Edition
Searchlight Updates - Liberty EditionOpenStack Foundation
 
Global EC Cluster Updates (OpenStack Mitaka Swift Design Summit)
Global EC Cluster Updates (OpenStack Mitaka Swift Design Summit)Global EC Cluster Updates (OpenStack Mitaka Swift Design Summit)
Global EC Cluster Updates (OpenStack Mitaka Swift Design Summit)Kota Tsuyuzaki
 
Durability Simulator Design for OpenStack Swift
Durability Simulator Design for OpenStack SwiftDurability Simulator Design for OpenStack Swift
Durability Simulator Design for OpenStack SwiftKota Tsuyuzaki
 
Containers and OpenStack: Marc Van Hoof, Kumulus: Containers and OpenStack
Containers and OpenStack: Marc Van Hoof, Kumulus: Containers and OpenStackContainers and OpenStack: Marc Van Hoof, Kumulus: Containers and OpenStack
Containers and OpenStack: Marc Van Hoof, Kumulus: Containers and OpenStackOpenStack
 
Openstack & rackspace – yesterday, today and tomorrow
 Openstack & rackspace – yesterday, today and tomorrow Openstack & rackspace – yesterday, today and tomorrow
Openstack & rackspace – yesterday, today and tomorrowsriram_rajan
 
NFV Infrastructure Manager with High Performance Software Switch Lagopus
NFV Infrastructure Manager with High Performance Software Switch Lagopus NFV Infrastructure Manager with High Performance Software Switch Lagopus
NFV Infrastructure Manager with High Performance Software Switch Lagopus Hirofumi Ichihara
 

Mais procurados (20)

The OpenStack Cloud at CERN - OpenStack Nordic
The OpenStack Cloud at CERN - OpenStack NordicThe OpenStack Cloud at CERN - OpenStack Nordic
The OpenStack Cloud at CERN - OpenStack Nordic
 
20161025 OpenStack at CERN Barcelona
20161025 OpenStack at CERN Barcelona20161025 OpenStack at CERN Barcelona
20161025 OpenStack at CERN Barcelona
 
OpenStack Toronto Q2 MeetUp - June 1st 2017
OpenStack Toronto Q2 MeetUp - June 1st 2017OpenStack Toronto Q2 MeetUp - June 1st 2017
OpenStack Toronto Q2 MeetUp - June 1st 2017
 
OpenStack @ CERN, by Tim Bell
OpenStack @ CERN, by Tim BellOpenStack @ CERN, by Tim Bell
OpenStack @ CERN, by Tim Bell
 
20150924 rda federation_v1
20150924 rda federation_v120150924 rda federation_v1
20150924 rda federation_v1
 
UNH-IOL USGv6 Profile Update Webinar
UNH-IOL USGv6 Profile Update WebinarUNH-IOL USGv6 Profile Update Webinar
UNH-IOL USGv6 Profile Update Webinar
 
20170926 cern cloud v4
20170926 cern cloud v420170926 cern cloud v4
20170926 cern cloud v4
 
Testbed and Opportunities. Jerry Horgan, TSSG
Testbed and Opportunities. Jerry Horgan, TSSGTestbed and Opportunities. Jerry Horgan, TSSG
Testbed and Opportunities. Jerry Horgan, TSSG
 
Open Source Operations
Open Source OperationsOpen Source Operations
Open Source Operations
 
OpenStack Journey in Tieto Elastic Cloud
OpenStack Journey in Tieto Elastic CloudOpenStack Journey in Tieto Elastic Cloud
OpenStack Journey in Tieto Elastic Cloud
 
OpenStack Ottawa Q3 Meetup September 26th 2017
OpenStack Ottawa Q3 Meetup   September 26th 2017OpenStack Ottawa Q3 Meetup   September 26th 2017
OpenStack Ottawa Q3 Meetup September 26th 2017
 
RIPE NCC Tools & Measurements
RIPE NCC Tools & MeasurementsRIPE NCC Tools & Measurements
RIPE NCC Tools & Measurements
 
Contiki OS Research Projects Guidance
Contiki OS Research Projects GuidanceContiki OS Research Projects Guidance
Contiki OS Research Projects Guidance
 
OpenStack Ottawa Q2 MeetUp - May 31st 2017
OpenStack Ottawa Q2 MeetUp - May 31st 2017OpenStack Ottawa Q2 MeetUp - May 31st 2017
OpenStack Ottawa Q2 MeetUp - May 31st 2017
 
Searchlight Updates - Liberty Edition
Searchlight Updates - Liberty EditionSearchlight Updates - Liberty Edition
Searchlight Updates - Liberty Edition
 
Global EC Cluster Updates (OpenStack Mitaka Swift Design Summit)
Global EC Cluster Updates (OpenStack Mitaka Swift Design Summit)Global EC Cluster Updates (OpenStack Mitaka Swift Design Summit)
Global EC Cluster Updates (OpenStack Mitaka Swift Design Summit)
 
Durability Simulator Design for OpenStack Swift
Durability Simulator Design for OpenStack SwiftDurability Simulator Design for OpenStack Swift
Durability Simulator Design for OpenStack Swift
 
Containers and OpenStack: Marc Van Hoof, Kumulus: Containers and OpenStack
Containers and OpenStack: Marc Van Hoof, Kumulus: Containers and OpenStackContainers and OpenStack: Marc Van Hoof, Kumulus: Containers and OpenStack
Containers and OpenStack: Marc Van Hoof, Kumulus: Containers and OpenStack
 
Openstack & rackspace – yesterday, today and tomorrow
 Openstack & rackspace – yesterday, today and tomorrow Openstack & rackspace – yesterday, today and tomorrow
Openstack & rackspace – yesterday, today and tomorrow
 
NFV Infrastructure Manager with High Performance Software Switch Lagopus
NFV Infrastructure Manager with High Performance Software Switch Lagopus NFV Infrastructure Manager with High Performance Software Switch Lagopus
NFV Infrastructure Manager with High Performance Software Switch Lagopus
 

Semelhante a OpenStack monitoring - Unidata S.p.A. Case Report

OpenStack Workshop - WECode Harvard Conference
OpenStack Workshop - WECode Harvard ConferenceOpenStack Workshop - WECode Harvard Conference
OpenStack Workshop - WECode Harvard ConferenceIccha Sethi
 
2011 Essex Summit: Openstack/Hyper-V clouds
2011 Essex Summit: Openstack/Hyper-V clouds2011 Essex Summit: Openstack/Hyper-V clouds
2011 Essex Summit: Openstack/Hyper-V cloudsppouliot
 
GEO Analytics Canada Overview April 2020
GEO Analytics Canada Overview April 2020GEO Analytics Canada Overview April 2020
GEO Analytics Canada Overview April 2020GEO Analytics Canada
 
Deep Dive Into the CERN Cloud Infrastructure - November, 2013
Deep Dive Into the CERN Cloud Infrastructure - November, 2013Deep Dive Into the CERN Cloud Infrastructure - November, 2013
Deep Dive Into the CERN Cloud Infrastructure - November, 2013Belmiro Moreira
 
The Art of Container Monitoring
The Art of Container MonitoringThe Art of Container Monitoring
The Art of Container MonitoringDerek Chen
 
Swimming upstream: OPNFV Doctor project case study
Swimming upstream: OPNFV Doctor project case studySwimming upstream: OPNFV Doctor project case study
Swimming upstream: OPNFV Doctor project case studyOPNFV
 
Auto id-labs-kaist-research-2014
Auto id-labs-kaist-research-2014Auto id-labs-kaist-research-2014
Auto id-labs-kaist-research-2014Daeyoung Kim
 
OpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander Dibbo
OpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander DibboOpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander Dibbo
OpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander DibboOpenNebula Project
 
ThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptxThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptxGrace Jansen
 
Améliorer OpenStack avec les technologies Intel
Améliorer OpenStack avec les technologies IntelAméliorer OpenStack avec les technologies Intel
Améliorer OpenStack avec les technologies IntelOdinot Stanislas
 
Cloud Architect Alliance #15: Openstack
Cloud Architect Alliance #15: OpenstackCloud Architect Alliance #15: Openstack
Cloud Architect Alliance #15: OpenstackMicrosoft
 
OpenstackOverview.pdf
OpenstackOverview.pdfOpenstackOverview.pdf
OpenstackOverview.pdfKevinBuck30
 
OpenStack and OpenDaylight, The Evolving Relationship in Cloud Networking: a ...
OpenStack and OpenDaylight, The Evolving Relationship in Cloud Networking: a ...OpenStack and OpenDaylight, The Evolving Relationship in Cloud Networking: a ...
OpenStack and OpenDaylight, The Evolving Relationship in Cloud Networking: a ...Cisco DevNet
 
OpenStack 101 - All Things Open 2015
OpenStack 101 - All Things Open 2015OpenStack 101 - All Things Open 2015
OpenStack 101 - All Things Open 2015Mark Voelker
 
Nagios Conference 2014 - Konstantin Benz - Monitoring Openstack The Relations...
Nagios Conference 2014 - Konstantin Benz - Monitoring Openstack The Relations...Nagios Conference 2014 - Konstantin Benz - Monitoring Openstack The Relations...
Nagios Conference 2014 - Konstantin Benz - Monitoring Openstack The Relations...Nagios
 
Openstackoverview-DEC2013
Openstackoverview-DEC2013Openstackoverview-DEC2013
Openstackoverview-DEC2013Michael Lessard
 
All about open stack
All about open stackAll about open stack
All about open stackDataCentred
 

Semelhante a OpenStack monitoring - Unidata S.p.A. Case Report (20)

OpenStack Workshop - WECode Harvard Conference
OpenStack Workshop - WECode Harvard ConferenceOpenStack Workshop - WECode Harvard Conference
OpenStack Workshop - WECode Harvard Conference
 
StarlingX - A Platform for the Distributed Edge | Ildiko Vancsa
StarlingX - A Platform for the Distributed Edge | Ildiko VancsaStarlingX - A Platform for the Distributed Edge | Ildiko Vancsa
StarlingX - A Platform for the Distributed Edge | Ildiko Vancsa
 
2011 Essex Summit: Openstack/Hyper-V clouds
2011 Essex Summit: Openstack/Hyper-V clouds2011 Essex Summit: Openstack/Hyper-V clouds
2011 Essex Summit: Openstack/Hyper-V clouds
 
GEO Analytics Canada Overview April 2020
GEO Analytics Canada Overview April 2020GEO Analytics Canada Overview April 2020
GEO Analytics Canada Overview April 2020
 
Deep Dive Into the CERN Cloud Infrastructure - November, 2013
Deep Dive Into the CERN Cloud Infrastructure - November, 2013Deep Dive Into the CERN Cloud Infrastructure - November, 2013
Deep Dive Into the CERN Cloud Infrastructure - November, 2013
 
The Art of Container Monitoring
The Art of Container MonitoringThe Art of Container Monitoring
The Art of Container Monitoring
 
Swimming upstream: OPNFV Doctor project case study
Swimming upstream: OPNFV Doctor project case studySwimming upstream: OPNFV Doctor project case study
Swimming upstream: OPNFV Doctor project case study
 
Opensource tools for OpenStack IAAS
Opensource tools for OpenStack IAASOpensource tools for OpenStack IAAS
Opensource tools for OpenStack IAAS
 
Auto id-labs-kaist-research-2014
Auto id-labs-kaist-research-2014Auto id-labs-kaist-research-2014
Auto id-labs-kaist-research-2014
 
OpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander Dibbo
OpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander DibboOpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander Dibbo
OpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander Dibbo
 
ThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptxThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptx
 
Améliorer OpenStack avec les technologies Intel
Améliorer OpenStack avec les technologies IntelAméliorer OpenStack avec les technologies Intel
Améliorer OpenStack avec les technologies Intel
 
Cloud Architect Alliance #15: Openstack
Cloud Architect Alliance #15: OpenstackCloud Architect Alliance #15: Openstack
Cloud Architect Alliance #15: Openstack
 
OpenstackOverview.pdf
OpenstackOverview.pdfOpenstackOverview.pdf
OpenstackOverview.pdf
 
OpenStack and OpenDaylight, The Evolving Relationship in Cloud Networking: a ...
OpenStack and OpenDaylight, The Evolving Relationship in Cloud Networking: a ...OpenStack and OpenDaylight, The Evolving Relationship in Cloud Networking: a ...
OpenStack and OpenDaylight, The Evolving Relationship in Cloud Networking: a ...
 
OpenStack 101
OpenStack 101OpenStack 101
OpenStack 101
 
OpenStack 101 - All Things Open 2015
OpenStack 101 - All Things Open 2015OpenStack 101 - All Things Open 2015
OpenStack 101 - All Things Open 2015
 
Nagios Conference 2014 - Konstantin Benz - Monitoring Openstack The Relations...
Nagios Conference 2014 - Konstantin Benz - Monitoring Openstack The Relations...Nagios Conference 2014 - Konstantin Benz - Monitoring Openstack The Relations...
Nagios Conference 2014 - Konstantin Benz - Monitoring Openstack The Relations...
 
Openstackoverview-DEC2013
Openstackoverview-DEC2013Openstackoverview-DEC2013
Openstackoverview-DEC2013
 
All about open stack
All about open stackAll about open stack
All about open stack
 

Mais de Davide Guerri

EuroCloud - Cloud Communities Day - OpenStack
EuroCloud - Cloud Communities Day - OpenStackEuroCloud - Cloud Communities Day - OpenStack
EuroCloud - Cloud Communities Day - OpenStackDavide Guerri
 
OpenStack 3rd bday - OUSG meeting intro
OpenStack 3rd bday - OUSG meeting introOpenStack 3rd bday - OUSG meeting intro
OpenStack 3rd bday - OUSG meeting introDavide Guerri
 
Multicast introduzione
Multicast introduzioneMulticast introduzione
Multicast introduzioneDavide Guerri
 
Caspur wi fi open source garr conf 2011
Caspur wi fi open source   garr conf 2011Caspur wi fi open source   garr conf 2011
Caspur wi fi open source garr conf 2011Davide Guerri
 
OpenStack User Group Italia
OpenStack User Group ItaliaOpenStack User Group Italia
OpenStack User Group ItaliaDavide Guerri
 
Provinciawifi IEEE Networks 2012
Provinciawifi IEEE Networks 2012Provinciawifi IEEE Networks 2012
Provinciawifi IEEE Networks 2012Davide Guerri
 
ProvinciaWiFi Offline Map - App contest openRoma
ProvinciaWiFi Offline Map - App contest openRomaProvinciaWiFi Offline Map - App contest openRoma
ProvinciaWiFi Offline Map - App contest openRomaDavide Guerri
 

Mais de Davide Guerri (7)

EuroCloud - Cloud Communities Day - OpenStack
EuroCloud - Cloud Communities Day - OpenStackEuroCloud - Cloud Communities Day - OpenStack
EuroCloud - Cloud Communities Day - OpenStack
 
OpenStack 3rd bday - OUSG meeting intro
OpenStack 3rd bday - OUSG meeting introOpenStack 3rd bday - OUSG meeting intro
OpenStack 3rd bday - OUSG meeting intro
 
Multicast introduzione
Multicast introduzioneMulticast introduzione
Multicast introduzione
 
Caspur wi fi open source garr conf 2011
Caspur wi fi open source   garr conf 2011Caspur wi fi open source   garr conf 2011
Caspur wi fi open source garr conf 2011
 
OpenStack User Group Italia
OpenStack User Group ItaliaOpenStack User Group Italia
OpenStack User Group Italia
 
Provinciawifi IEEE Networks 2012
Provinciawifi IEEE Networks 2012Provinciawifi IEEE Networks 2012
Provinciawifi IEEE Networks 2012
 
ProvinciaWiFi Offline Map - App contest openRoma
ProvinciaWiFi Offline Map - App contest openRomaProvinciaWiFi Offline Map - App contest openRoma
ProvinciaWiFi Offline Map - App contest openRoma
 

Último

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 

Último (20)

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 

OpenStack monitoring - Unidata S.p.A. Case Report

  • 1. OpenStack and Monitoring Unidata S.p.A. case report Davide Guerri - Unidata S.p.A. - d.guerri@unidata.it
  • 2. Agenda • What is Unidata S.p.A.? • (Cloud) monitoring • OpenStack Monitoring • Unidata case report
  • 3. Unidata S.p.A. • established in 1985 • pioneer of microcomputer technology in Italy • today one of the most important ISPs • PoP at NaMeX, MiX, AMS-IX • large fiber infrastructure (Rome and province of Rome) • a large number of WiFi installations (based on the OpenWISP project) also for the Italian PA • institutional partners • AIIP - first Italian ISPs Association - founder and member, 1995 • NaMeX - Internet exchange and interconnection point - founder and member, 1995 • strong vocation for innovation (making significant investments in R&D)
  • 4. Unidata S.p.A. • since 2012 - public and private cloud services • UniCloud [3] - yep, it’s OpenStack! ;-) • Folsom release • Full access to OpenStack API (SSL) • IPv6 enabled
  • 7. Puppies vs Cattle • (crude) analogy that describes the most appropriate use of the cloud paradigm • “The servers in today’s data center are like puppies – they’ve got names and when they get sick, everything grinds to a halt while you nurse them back to health” -- Joshua McKenty, co-founder of Piston Cloud • treat servers like cattle • a single server should easily replaced • it should be possible to (seamlessly) increment or decrement their number for a given application
  • 8. Puppies vs Cattle • ...not only for VMs... • it also make sense for the bare-metal • this also changes something for monitoring, doesn’t it?
  • 9. Cloud Monitoring • for cloud monitoring we’ve got two points of view • operators • infrastructural monitoring • end users • cloud infrastructural resources (IaaS) monitoring (e.g. cloud servers monitoring) • cloud services monitoring (SaaS/PaaS)
  • 10. Cloud Monitoring • in both cases: what to monitor? and with what purpose? • availability - for proactive anomalies fix • efficiency - for (proactive) capacity planning • what is needed? • alerting systems • instantaneous measures • historical data
  • 12. OpenStack Monitoring • as of today (Grizzly release) there is no integrated and ready-to-use monitoring system [1] • what about Ceilometer? • general purpose measurement collector
  • 13. OpenStack Monitoring • Healthnmon (uses ceilometer) [2] • inventory management • alerts and notifications • utilization data (CPU, RAM, network, storage) for guests and hosts
  • 14. ...meanwhile... • those who already offer cloud services based on the OpenStack had to develop (semi-) ad-hoc solutions • OpenStack is massively scalable... • ...so also the monitoring system should be scalable • the good news is that we have all the ingredients • and they are free and open source ;-)
  • 15. What to monitor? • load average/ CPUs/RAM/swap/disk & network usage • alerts based on absolute (and relative) thresholds • health of storage resources • logs analysis • system integrity checks
  • 16. What to monitor? • OpenStack specific • services availability and logs of the following • nova-* • glance-* • cinder-* • keystone • horizon • misc (dnsmasq, swift, rabbitmq)
  • 18. UniCloud • UniCloud logical architecture - public cloud infrastructure
  • 20. UniCloud Monitoring • Zenoss core, for infrastructural monitoring • open source (GPLv2) • SNMP and network protocol monitoring of applications, servers and network devices • auto-discovery / auto-modeling • crucial for automatizations (puppies vs cattle) • just add the SNMP agent to the configuration of new nodes (e.g. with Puppet)
  • 21. UniCloud - Zenoss core • Web UI with events and infrastructure summary • historical data browsing • customizable reports • real-time email or user- defined alerts • simple integration with an SMS gateway
  • 22. UniCloud Monitoring • OpenStack/Systems logs • swatch - email alerts for errors/anomalies • logwatch - daily system status review • system integrity (and security) • smartmontools - health of hard drives with email notifications • rkhunter - daily systems status analysis and (eventual) alerting • arpwatch - real-time ARP monitoring (detection of duplicate IPs)
  • 24. UniCloud Monitoring • ad hoc monitoring system based on • OpenStack API • Collectd [5] • collects, transfers and stores performance data of computers and network equipment • modular architecture • we used RRD, LibVirt, and network plugins • free and open source (GPLv2) • we wrote a patch for the LibVirt plugin - included since version 5.2 [6]
  • 25. UniCloud Monitoring • Front-end • WEB-UI RoR (written from scratch) • OpenStack ActiveResource - Ruby binding for OpenStack API by Unidata S.p.A. [7]
  • 26. UniCloud Monitoring • hypervisors • acquire “raw” data from LibVirt (localhost) • sends structured data to the collector • collector • receives data from the network • (efficiently) writes RRD files • RoR application • establishes a mapping between OpenStack cloud instances and RRD files (via API) • renders performance graphs to fulfill user requests (instances and timespans)
  • 27. UniCloud Monitoring • What gets monitored? • all the measurements that the collectd LibVirt plugin makes available • for each vCPU - utilization rate (%) • for each network interface - pps, bps and eps (in+out) • for each disks - bps and ops (read+write) • with “extra volumes” from nova-volume (or cinder)
  • 28. UniCloud Monitoring • Does it scale? • collectd is not a new product... • it has proven itself to be very reliable and scalable • it’s possible to use multiple collectors • for HA (using multicast) or LB • puppies vs cattle? • automatic discovery of new cloud instances • collectd installation and configuration should be made by means of a configuration management system (e.g. Puppet)
  • 29. UniCloud Monitoring Collectd configuration example (/etc/collectd/collectd.conf) Collector Hypervisors
  • 34. [1] OpenStack official programs https://wiki.openstack.org/wiki/Programs [2] Ceilometer and Healthnmon https://wiki.openstack.org/wiki/Ceilometer/CeilometerAndHealthnmon [3] UniCloud http://unicloud.it [4] Zenoss http://zenoss.com [5] Collectd http://collectd.org [6] Collectd 5.2 changelog https://collectd.org/wiki/index.php/Version_5.2 [7] OpenStack ActiveResource https://github.com/Unidata-SpA/openstack_activeresource References