Building a Front End for a Sensor Data Cloud

•

0 gostou•532 visualizações

The talk was delivered by Ian Rolewicz at the International Workshop on Cloud for High Performance Computing 2011 (C4HPC'11), co-located with the 2011 International Conference on Computational Science and its Applications (ICCSA 2011) . Publication: http://bit.ly/GRBkC2 Abstract: This document introduces the TimeCloud Front End, a web based interface for the TimeCloud platform that manages large-scale time series in the cloud. While the Back End is built upon scalable, fault tolerant distributed systems as Hadoop and HBase and takes novel approaches for faciliating data analysis over massive time series, the Front End was built as a simple and intuitive interface for viewing the data present in the cloud, both with simple tabular display and the help of various visualizations. In addition, the Front End implements model-based views and data fetch on-demand for reducing the amount of work performed at the Back End.

Tecnologia Economia e finanças

Building a Front End
Interface for a
Sensor Data Cloud
Ian Rolewicz
Semester Project, FALL 2010
Supervised by Hoyoung Jeung, Michele
Catasta & Zoltán Miklós

Introducing TimeCloud

• Platform for massive time-series
management and analysis
• Currently developed at the LSIR

The Front End
• Web-based interface
• Main Goals:
– Display the Data
– Be user-friendly (preferably)
– Reduce the work performed at the Back End
• Implemented in Python using the Django
Framework and the YUI 2 library.
• Visualizations implemented with Protovis

Full Precision vs. Model-Based
• Full Precision
– Real Data
– Whole Data taken from the Back End
– Only display at the Front End
• Model-Based Approximations
– Reconstructed Data from Parameters
– Less Data retrieved from the Back End
– Reconstruction and display of the values at
the Front End

The Data Model

• NULLs not stored in HBase → better for sparse
data
• Column families stored in separate files

Performance Measures
• Testbed on a cluster of 13 Amazon EC2
servers, each having:
– 15 GB Memory
– 8 EC2 Computing Units
– 1.7 TB Storage
– 64-bit platform
• One of them: HBase Master + Front End
• 12 others: HBase Region Servers

Data Used for Measures

• « Worst-case » for TimeCloud
• Compress no more than 1/5 of original
data when linearly approximated
• Linear regression → in GSN, usually 99%
of compression

Random Reads

• 1000 random reads in approximated
dataset
• Evenly spread
• 22% improvement in query execution time
• Less data retrieved → more cache hits

Network usage
KB transferred KB transferred
Graph #
(original) (approximated)
1 112.3 23.3
2 124.5 28.0
3 126.6 25.9
4 120.2 25.1
5 119.9 26.8
6 124.4 27.7

Conclusion
• Goals achieved:
– Display the Data
– Keep it simple
– Reduce the work performed at the Back End
• Good Basis for future extensions
• Future Work
– User/Group-based managment and access
– Completion of the model-based views
– Design of additional visualizations

Building a Front End for a Sensor Data Cloud

Mais conteúdo relacionado

Mais procurados

Cloud infrastructure on Apache Mesos

Ahmed Bacha

Simulating Heterogeneous Resources in CloudLightning

CloudLightning

Solving Your Backup Needs Using MongoDB Ops Manager, Cloud Manager and Atlas

MongoDB

The Google BigQuery Story: Optimizing 25PB Storage

Ivan Kosianenko

Why would I store my data in more than one database?

Kurtosys Systems

MongoDB.local Austin 2018: Solving Your Backup Needs Using MongoDB Ops Manage...

MongoDB

Apache Hadoop India Summit 2011 talk "The Next Generation of Hadoop MapReduce...

Yahoo Developer Network

MapReduce - Hadoop - Big Data

Nafiz Ishtiaque Ahmed

Surge 2013: Maximizing Scalability, Resiliency, and Engineering Velocity in t...

Coburn Watson

Project Progress

sunnysomchok

SQL Server Reporting Services (SSRS) is an easy-to-use tool for automating reports and creating highly visual dashboards. Although SSRS is easy to learn there are many tips and tricks that can improve your report building experience, not to mention make your reports run blazing fast! This rapid-fire session goes over my learnings from the past six years of developing high-performance SSRS reports, including topics like multivalue parameter efficiencies, how to best utilize subreports, and performing SQL CRUD operations with SSRS. Each rapid-fire topic includes sample data and an SSRS reporting example that users will be able to try out for themselves.

High Performance SSRS

Bert Wagner

goto; London: Keeping your Cloud Footprint in Check

Coburn Watson

ETL with Clustered Columnstore - PASS Summit 2014

Niko Neugebauer

Modeling heterogeneous virtual machines on iaa s data centers

ieeepondy

XRM: An Event-based Resource Management Framework for XCP

Pradeep Padala

Psdot 1 optimization of resource provisioning cost in cloud computing

ZTech Proje

AWS Customer Presentation - JovianDATA

Amazon Web Services

Jelastic Overview

Jelastic Multi-Cloud PaaS

Cloud - High Availability @ Low Cost - Workshop - Gurpreet ahuja

ResellerClub

Mais procurados (19)

Cloud infrastructure on Apache Mesos

Simulating Heterogeneous Resources in CloudLightning

Solving Your Backup Needs Using MongoDB Ops Manager, Cloud Manager and Atlas

The Google BigQuery Story: Optimizing 25PB Storage

Why would I store my data in more than one database?

MongoDB.local Austin 2018: Solving Your Backup Needs Using MongoDB Ops Manage...

Apache Hadoop India Summit 2011 talk "The Next Generation of Hadoop MapReduce...

MapReduce - Hadoop - Big Data

Surge 2013: Maximizing Scalability, Resiliency, and Engineering Velocity in t...

Project Progress

High Performance SSRS

goto; London: Keeping your Cloud Footprint in Check

ETL with Clustered Columnstore - PASS Summit 2014

Modeling heterogeneous virtual machines on iaa s data centers

XRM: An Event-based Resource Management Framework for XCP

Psdot 1 optimization of resource provisioning cost in cloud computing

AWS Customer Presentation - JovianDATA

Jelastic Overview

Cloud - High Availability @ Low Cost - Workshop - Gurpreet ahuja

Semelhante a Building a Front End for a Sensor Data Cloud

سکوهای ابری و مدل های برنامه نویسی در ابر

datastack

While cloud computing offers virtually unlimited capacity, harnessing that capacity in an efficient, cost effective fashion can be cumbersome and difficult at the workload level. At the organizational level, it can quickly become chaos. You must make choices around cloud deployment, and these choices could have a long-lasting impact on your organization. It is important to understand your options and avoid incomplete, complicated, locked-in scenarios. Data management and placement challenges make having the ability to automate workflows and processes across multiple clouds a requirement. In this webinar, you will: • Learn how to leverage cloud services as part of an overall computation approach • Understand data management in a cloud-based world • Hear what options you have to orchestrate HPC in the cloud • Learn how cloud orchestration works to automate and align computing with specific goals and objectives • See an example of an orchestrated HPC workload using on-premises data From computational research to financial back testing, and research simulations to IoT processing frameworks, decisions made now will not only impact future manageability, but also your sanity.

Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind

Avere Systems

Big Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau

Sam Palani

Windows Azure introduction

Microsoft Iceland

Visualizing big data in the browser using spark

Databricks

Java scalability considerations yogesh deshpande

IndicThreads

Cloud

Damilola Mosaku

Taking Splunk to the Next Level - Architecture Breakout Session

Splunk

Mobile+Cloud: a viable replacement for desktop cheminformatics?

Alex Clark

advance computing and big adata analytic.pptx

TeddyIswahyudi1

In this talk, Eitan Suez explores the question: Where does Geode fit in an organization's system architecture? Geode is a unique and feature-rich product that perhaps hasn't seen as much adoption as it deserves. Today's apps are no longer the straightforward, database-backed web applications we used to build a few years ago. Applications have become more sophisticated, as they've had to meet the need to scale, to be reliable, fault-tolerant, and to integrate with other systems. In this talk, Eitan will suggest one particular fit for Geode in the context of a CQRS architecture, and welcomes you to attend, and to contribute by sharing how you've put Geode to use in your organization.

#GeodeSummit - Where Does Geode Fit in Modern System Architectures

PivotalOpenSourceHub

Learn why Snowflake analytic data warehouse makes sense for BI including data loading flexibility and scalability, consumption-based storage and compute costs, Time Travel and data sharing features, support across a range of BI tools like Power BI and Tableau and ability to allocate compute costs. View this on-demand webinar: https://senturus.com/resources/10-reasons-snowflake-is-great-for-analytics/. Senturus offers a full spectrum of services in business intelligence and training on Cognos, Tableau and Power BI. Our resource library has hundreds of free live and recorded webinars, blog posts, demos and unbiased product reviews available on our website at: http://www.senturus.com/senturus-resources/.

10 Reasons Snowflake Is Great for Analytics

Senturus

IBM - Introduction to Cloudant

Francisco González Jiménez

0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2

Raul Chong

Gcp dataflow

Igor Roiter

2014 09-12 lambda-architecture-at-indix

Yu Ishikawa

While cost is a primary "c" driving the adoption of object-based cloud solutions in the life sciences, compute, capacity, and collaboration may all be bigger incentives. In this webinar, we'll examine how to use an Avere Hybrid Cloud NAS infrastructure to gain big benefits in areas like genomics research, personalized medicine, drug discovery, imaging, and other data analysis applications. • Compute - Building production environments in the compute cloud without rewriting existing applications • Capacity - Modernizing storage archives and disaster recovery by adding object storage for durability while leveraging existing on-premises NAS • Collaboration - Using the cloud t o safely and securely share data globally • Cost - Using cloud to lower overall costs to keep pace with fast-growing demands of research initiatives

4 C’s for Using Cloud to Support Scientific Research

Avere Systems

Distributed Computing with Apache Hadoop: Technology Overview

Konstantin V. Shvachko

Paradigm shift in IBM's OLAP solutions and look deeply at IBM Cognos 10.2 Dynamic Cubes. View the webinar video recording and download this deck: http://www.senturus.com/resources/dynamic-cubesin-cognos-10-2-jan/. This webinar included discussions and demonstrations of IBM Cognos 10.2 Cube Designer infrastructure requirements and deployment proven practices, Dynamic Cube Designer, and the world of OLAP in 2013. Senturus, a business analytics consulting firm, has a resource library with hundreds of free recorded webinars, trainings, demos and unbiased product reviews. Take a look and share them with your colleagues and friends: http://www.senturus.com/resources/.

IBM Cognos 10.2 Dynamic Cubes Deeper Dive

Senturus

Scalability Design Principles - Internal Session

Sachin Sancheti - Microsoft Azure Architect

Semelhante a Building a Front End for a Sensor Data Cloud (20)

سکوهای ابری و مدل های برنامه نویسی در ابر

Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind

Big Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau

Windows Azure introduction

Visualizing big data in the browser using spark

Java scalability considerations yogesh deshpande

Cloud

Taking Splunk to the Next Level - Architecture Breakout Session

Mobile+Cloud: a viable replacement for desktop cheminformatics?

advance computing and big adata analytic.pptx

#GeodeSummit - Where Does Geode Fit in Modern System Architectures

10 Reasons Snowflake Is Great for Analytics

IBM - Introduction to Cloudant

0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2

Gcp dataflow

2014 09-12 lambda-architecture-at-indix

4 C’s for Using Cloud to Support Scientific Research

Distributed Computing with Apache Hadoop: Technology Overview

IBM Cognos 10.2 Dynamic Cubes Deeper Dive

Scalability Design Principles - Internal Session

Mais de PlanetData Network of Excellence

Dl2014 slides

PlanetData Network of Excellence

A Contextualized Knowledge Repository for Open Data about Trentino

PlanetData Network of Excellence

On Leveraging Crowdsourcing Techniques for Schema Matching Networks

PlanetData Network of Excellence

Towards Enabling Probabilistic Databases for Participatory Sensing

PlanetData Network of Excellence

Privacy-Preserving Schema Reuse

PlanetData Network of Excellence

Pay-as-you-go Reconciliation in Schema Matching Networks

PlanetData Network of Excellence

Demo: tablet-based visualisation of transport data in Madrid using SPARQLstream

PlanetData Network of Excellence

On the need for a W3C community group on RDF Stream Processing

PlanetData Network of Excellence

Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...

PlanetData Network of Excellence

Linking Smart Cities Datasets with Human Computation: the case of UrbanMatch

PlanetData Network of Excellence

SciQL, Bridging the Gap between Science and Relational DBMS

PlanetData Network of Excellence

CLODA: A Crowdsourced Linked Open Data Architecture

PlanetData Network of Excellence

Scalable Nonmonotonic Reasoning over RDF Data Using MapReduce

PlanetData Network of Excellence

Data and Knowledge Evolution

PlanetData Network of Excellence

The presentation was delivered by FORTH at the 3rd International Workshop on the role of Semantic Web in Provenance Management 2012 (SWPM2012) in Heraklion, Greece on 28th of May 2012. Abstract: Workflow systems can produce very large amounts of provenance information. In this paper we introduce provenance-based inference rules as a means to reduce the amount of provenance information that has to be stored, and to ease quality control (e.g., corrections). We motivate this kind of (provenance) inference and identify a number of basic inference rules over a conceptual model appropriate for representing provenance. The proposed inference rules concern the interplay between (i) actors and carried out activities, (ii) activities and devices that were used for such activities, and, (iii) the presence of information objects and physical things at events. However, since a knowledge base is not static but it changes over time for various reasons, we also study how we can satisfy change requests while supporting and respecting the aforementioned inference rules. Towards this end, we elaborate on the specification of the required change operations.

Evolution of Workflow Provenance Information in the Presence of Custom Infere...

PlanetData Network of Excellence

This paper was presented by Vassilis Papakonstantinou at the 17th ACM Symposium on Access Control Models and Technologies (ACM SACMAT 2012) in Newark, USA, June 20 - 22, 2012. Abstract: The Resource Description Framework (RDF) has become the defacto standard for representing information in the Semantic Web. Given the increasing amount of sensitive RDF data available on the Web, it becomes increasingly critical to guarantee secure access to this content. In this paper we advocate the use of an abstract access control model to ensure the selective exposure of RDF information. The model is defined by a set of abstract operators. Tokens are used to label RDF triples with access information. Abstract operators model RDF Schema inference rules and propagation of labels along the RDF Schema (RDFS) class and property hierarchies. In this way, the access label of a triple is a complex expression that involves the labels of the triples and the operators applied to obtain said label. Different applications can then adopt different concrete access policies that encode an assignment of the abstract tokens and operators to concrete (specific) values. Following this approach, changes in the interpretation of abstract tokens and operators can be easily implemented resulting in a very flexible mechanism that allows one to easily experiment with different concrete access policies (defined per context or user). To demonstrate the feasibility of the approach, we implemented our ideas on top of the MonetDB and PostgreSQL open source database systems. We conducted an initial set of experiments which showed that the overhead for using abstract expressions is roughly linear to the number of triples considered; performance is also affected by the characteristics of the dataset, such as the size and depth of class and property hierarchies as well as the considered concrete policy.

Access Control for RDF graphs using Abstract Models

PlanetData Network of Excellence

Arrays in Databases, the next frontier?

PlanetData Network of Excellence

This talk was given by FORTH, Greece, at the European Data Forum (EDF) 2012 took place on June 6-7, 2012 in Copenhagen (Denmark) at the Copenhagen Business School (CBS). Abstract: Given the increasing amount of sensitive RDF data available on the Web, it becomes increasingly critical to guarantee secure access to this content. Access control is complicated when RDFS inference rules and other dependencies between access permissions of triples need to be considered; this is necessary, e.g., when we want to associate the access permissions of inferred triples with the ones that implied it. In this paper we advocate the use of abstract provenance models that are defined by means of abstract tokens operators to support fine grained access control for RDF graphs. The access label of a triple is a complex expression that encodes how said label was produced (i.e., the triples that contributed to its computation). This feature allows us to know exactly the effects of any possible change, thereby avoiding a complete recomputation of the labels when a change occurs. In addition, the same application can choose to enforce different access control policies or, different applications can enforce different policies on the same data, avoiding the recomputation of the label of a triple. Preliminary experiments have shown the applicability and benefits of our approach.

Abstract Access Control Model for Dynamic RDF Datasets

PlanetData Network of Excellence

This talk has been given at the 13th International Conference on Principles of Knowledge Representation and Reasoning (KR 2012) to be held in Rome, Italy, June 10-14, 2012 by Ilias Tahmazidis (FORTH). Abstract: We are witnessing an explosion of available data from the Web, government authorities, scientific databases, sensors and more. Such datasets could benefit from the introduction of rule sets encoding commonly accepted rules or facts, application- or domain-specific rules, commonsense knowledge etc. This raises the question of whether, how, and to what extent knowledge representation methods are capable of handling the vast amounts of data for these applications. In this paper, we consider nonmonotonic reasoning, which has traditionally focused on rich knowledge structures. In particular, we consider defeasible logic, and analyze how parallelization, using the MapReduce framework, can be used to reason with defeasible rules over huge data sets. Our experimental results demonstrate that defeasible reasoning with billions of data is performant, and has the potential to scale to trillions of facts.

Towards Parallel Nonmonotonic Reasoning with Billions of Facts

PlanetData Network of Excellence

The presentation was delivered during the 1st International Conference on Health Information Science (HIS 2012) on April 9th, 2012 in Beijing, China. Abstract: In cytomics bookkeeping of the data generated during lab experiments is crucial. The current approach in cytomics is to conduct High-Throughput Screening (HTS) experiments so that cells can be tested under many different experimental conditions. Given the large amount of different conditions and the readout of the conditions through images, it is clear that the HTS approach requires a proper data management system to reduce the time needed for experiments and the chance of man-made errors. As different types of data exist, the experimental conditions need to be linked to the images produced by the HTS experiments with their metadata and the results of further analysis. Moreover, HTS experiments never stand by themselves, as more experiments are lined up, the amount of data and computations needed to analyze these increases rapidly. To that end cytomic experiments call for automated and systematic solutions that provide convenient and robust features for scientists to manage and analyze their data. In this paper, we propose a platform for managing and analyzing HTS images resulting from cytomics screens taking the automated HTS workflow as a starting point. This platform seamlessly integrates the whole HTS workflow into a single system. The platform relies on a modern relational database system to store user data and process user requests, while providing a convenient web interface to end-users. By implementing this platform, the overall workload of HTS experiments, from experiment design to data analysis, is reduced significantly. Additionally, the platform provides the potential for data integration to accomplish genotype-to-phenotype modeling studies.

Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...

PlanetData Network of Excellence

Mais de PlanetData Network of Excellence (20)

Dl2014 slides

A Contextualized Knowledge Repository for Open Data about Trentino

On Leveraging Crowdsourcing Techniques for Schema Matching Networks

Towards Enabling Probabilistic Databases for Participatory Sensing

Privacy-Preserving Schema Reuse

Pay-as-you-go Reconciliation in Schema Matching Networks

Demo: tablet-based visualisation of transport data in Madrid using SPARQLstream

On the need for a W3C community group on RDF Stream Processing

Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...

Linking Smart Cities Datasets with Human Computation: the case of UrbanMatch

SciQL, Bridging the Gap between Science and Relational DBMS

CLODA: A Crowdsourced Linked Open Data Architecture

Scalable Nonmonotonic Reasoning over RDF Data Using MapReduce

Data and Knowledge Evolution

Evolution of Workflow Provenance Information in the Presence of Custom Infere...

Access Control for RDF graphs using Abstract Models

Arrays in Databases, the next frontier?

Abstract Access Control Model for Dynamic RDF Datasets

Towards Parallel Nonmonotonic Reasoning with Billions of Facts

Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...

Último

Partners Life - Insurer Innovation Award 2024

The Digital Insurer

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

This presentations targets students or working professionals. You may know Google for search, YouTube, Android, Chrome, and Gmail, but did you know Google has many developer tools, platforms & APIs? This comprehensive yet still high-level overview outlines the most impactful tools for where to run your code, store & analyze your data. It will also inspire you as to what's possible. This talk is 50 minutes in length.

Powerful Google developer tools for immediate impact! (2023-24 C)

wesley chun

Tata AIG General Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

As privacy and data protection regulations evolve rapidly, organizations operating in multiple jurisdictions face mounting challenges to ensure compliance and safeguard customer data. With state-specific privacy laws coming up in multiple states this year, it is essential to understand what their unique data protection regulations will require clearly. How will data privacy evolve in the US in 2024? How to stay compliant? Our panellists will guide you through the intricacies of these states' specific data privacy laws, clarifying complex legal frameworks and compliance requirements. This webinar will review: - The essential aspects of each state's privacy landscape and the latest updates - Common compliance challenges faced by organizations operating in multiple states and best practices to achieve regulatory adherence - Valuable insights into potential changes to existing regulations and prepare your organization for the evolving landscape

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

TrustArc

[2024]Digital Global Overview Report 2024 Meltwater.pdf

hans926745

Tech Trends Report 2024 Future Today Institute.pdf

hans926745

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida. In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization. In this session, participants gained answers to the following questions: - What is a Green Information Management (IM) Strategy, and why should you have one? - How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication? - How can an organization use insights into their data to influence employee behavior for IM? - How can you reap additional benefits from content reduction that go beyond Green IM?

Driving Behavioral Change for Information Management through Data-Driven Gree...

Enterprise Knowledge

Scaling API-first – The story of a global engineering organization

Radu Cotescu

What are drone anti-jamming systems? The drone anti-jamming systems and anti-spoof technology protect against interference, jamming, and spoofing of the UAVs. To protect their security, countries are beginning to research drone anti-jamming systems, also known as drone strike weapons. The anti-jam and anti-spoof technology protects against interference, jamming and spoofing. A drone strike weapon is a drone attack weapon that can attack and destroy enemy drones. So what is so unique about this amazing system?

What Are The Drone Anti-jamming Systems Technology?

Antenna Manufacturer Coco

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

Three things you will take away from the session: • How to run an effective tenant-to-tenant migration • Best practices for before, during, and after migration • Tips for using migration as a springboard to prepare for Copilot in Microsoft 365 Main ideas: Migration Overview: The presentation covers the current reality of cross-tenant migrations, the triggers, phases, best practices, and benefits of a successful tenant migration Considerations: When considering a migration, it is important to consider the migration scope, performance, customization, flexibility, user-friendly interface, automation, monitoring, support, training, scalability, data integrity, data security, cost, and licensing structure Next Wave: The next wave of change includes the launch of Copilot, which requires businesses to be prepared for upcoming changes related to Copilot and the cloud, and to consolidate data and tighten governance ShareGate: ShareGate can help with pre-migration analysis, configurable migration tool, and automated, end-user driven collaborative governance

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

sammart93

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

AWS Community Day CPH - Three problems of Terraform

Andrey Devyatkin

Real Time Object Detection Using Open CV

Khem

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

Building a Front End for a Sensor Data Cloud

1. Building a Front End Interface for a Sensor Data Cloud Ian Rolewicz Semester Project, FALL 2010 Supervised by Hoyoung Jeung, Michele Catasta & Zoltán Miklós

2. Introducing TimeCloud • Platform for massive time-series management and analysis • Currently developed at the LSIR

3. TimeCloud System Overview

4. My job

5. The Front End • Web-based interface • Main Goals: – Display the Data – Be user-friendly (preferably) – Reduce the work performed at the Back End • Implemented in Python using the Django Framework and the YUI 2 library. • Visualizations implemented with Protovis

6. TimeCloud Front End Live Demo

7. Full Precision vs. Model-Based • Full Precision – Real Data – Whole Data taken from the Back End – Only display at the Front End • Model-Based Approximations – Reconstructed Data from Parameters – Less Data retrieved from the Back End – Reconstruction and display of the values at the Front End

8. The Data Model • NULLs not stored in HBase → better for sparse data • Column families stored in separate files

9. Performance Measures • Testbed on a cluster of 13 Amazon EC2 servers, each having: – 15 GB Memory – 8 EC2 Computing Units – 1.7 TB Storage – 64-bit platform • One of them: HBase Master + Front End • 12 others: HBase Region Servers

10. Data Used for Measures • « Worst-case » for TimeCloud • Compress no more than 1/5 of original data when linearly approximated • Linear regression → in GSN, usually 99% of compression

11. Random Reads • 1000 random reads in approximated dataset • Evenly spread • 22% improvement in query execution time • Less data retrieved → more cache hits

12. Scan

13. Network usage KB transferred KB transferred Graph # (original) (approximated) 1 112.3 23.3 2 124.5 28.0 3 126.6 25.9 4 120.2 25.1 5 119.9 26.8 6 124.4 27.7

14. Conclusion • Goals achieved: – Display the Data – Keep it simple – Reduce the work performed at the Back End • Good Basis for future extensions • Future Work – User/Group-based managment and access – Completion of the model-based views – Design of additional visualizations

15. Questions ?

Building a Front End for a Sensor Data Cloud

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (19)

Semelhante a Building a Front End for a Sensor Data Cloud

Semelhante a Building a Front End for a Sensor Data Cloud (20)

Mais de PlanetData Network of Excellence

Mais de PlanetData Network of Excellence (20)

Último

Último (20)

Building a Front End for a Sensor Data Cloud