Creating a Data-Driven Government: Big Data With Purpose

•Transferir como PPTX, PDF•

2 gostaram•1,017 visualizações

The U.S. Department of Commerce collects, processes and disseminates data on a range of issues that impact our nation. Whether it's data on the economy, the environment, or technology, data is critical in fulfilling the Department's mission of creating the conditions for economic growth and opportunity. It is this data that provides insight, drives innovation, and transforms our lives. The U.S. Department of Commerce has become known as "America's Data Agency" due to the tens of thousands of datasets including satellite imagery, material standards and demographic surveys. But having a host of data and ensuring that this data is open and accessible to all are two separate issues. The latter, expanding open data access, is now a key pillar of the Commerce Department's mission. It was this focus on enhancing open data that led to the creation of the Commerce Data Service (CDS). The mission at the Commerce Data Service is to enable more people to use big data from across the department in innovative ways and across multiple fields. In this talk, I will explore how we are using big data to create a data-driven government. This talk is a keynote given at the Texas tech University's Big Data Symposium.

Tecnologia

Creating a Data-Driven Government
Big Data With Purpose
Dr Tyrone W A Grandison
Deputy Chief Data Officer

<< Log(radiances) >>
The US as a histogram

dim light average light intense light
radiance roughly proxies for people activity

commercedataservice.github.io/tutorial_viirs_part1

•MATRIX OF HISTOGRAMS
commercedataservice.github.io/tutorial_viirs_part1

Two histogram comparison
New York City Las Vegas
commercedataservice.github.io/tutorial_viirs_part1

Y(Labor Forcei)
.
.
.
X(Radiancei,j … Radiancei,n)
.
.
.
=
commercedataservice.github.io/tutorial_viirs_part1

Illegal fishing
gas flares
population
?!?!?!

growth and opportunity
69,583 datasets ~ 35.9%

government takes
on the hardest,
inelastic problems

“What’s your stack?”
“How fast is your GPU cluster in
traversing the graph?
“Are you a Spark guy?”

In government, there’s a lot
more to algorithmic accuracy
than
a score.
TPR
AUC
F-1
Prec.
MSE
MAPE

• A reason for existence
• Access to the field
• Access to actionable data
• Ethical intervention points
• Methodologically defensible yet intellectually
accessible
• Path to sustainability
Six conditions for data awesomeness

Influence strategy and operations
Seed for innovation

Algorithmic
Intelligence
For New Exporters

Case: Who is export-ready and to
what degree?
Unsupervised
Learning with a
hint of supervised
learning
Differentiated services
for
new markets

Case: A trade specialist in rural
America may need to drive 2
hours to meet a potential exporter.
Conversion
Scoring
Problem
Know your utility
before you go

Case: Which positions in a
company are like to use which
services?
Transition
probabilities
Sets expectations

Upskill through data
education to seed for
change and improvement

Start small: an experiment
4 Three-hour course taught by General
Assembly

Pilot Results
422 Registrations
90% Attendance rate

Data Science I: Basics / Working with
Teams (Git and GitHub) / Intro to Object-
Oriented Programming (Python &
JavaScript) / Using APIs (Intro to REST) /
Intro to Photoshop / Intro to Python / Basic
SQL (Using Sqlite3) / Building APIs / Intro
to R / Intro to JavaScript / Intro to Data
Analysis with Python / Data Wrangling with
pandas / Agile Development / HTML + CSS
/ Storytelling with Data / Excel / Intro to
Machine Learning / Visual Analytics with
Python / Data Storytelling with R

2016 Season (Scale Experiment)
14 Three-hour course taught by
Commerce Data Service staff
Two-week intensives on data science
and data visualization via General
Assembly
2
Option to be a data scientist or data
engineer-in-residence

Initial Response
3,500 Registrations
15 Participants for
In-Residence program
10 Bureaus represented
1 Model forked by
another federal
agency

4x more courses
6.9x growth in interest
unlimited potential

the upshot
Data skills are now a ”thing”
+ there is an internal market

Commerce Data
valuable, open, big, under-
utilized, unused

Commerce Data Usability Project
commerce.gov/datausability

Find the right users Understand security Find affordable housing Determine hail risk
Predict rainfall and flooding Determine human activity;
using satellite data
Help with Water
Management

a novel analysis or question posed
to the data
—
visually arresting graphics and
engagement with the public
—
open, free code and data for the
public to use
Contribute

Income Inequality is a hard
topic to interact with…
So people don’t.

How might we create a better
‘conversation’ and/or experience with
data around income inequality?
purpose

Create a basis of knowledge for
Americans on income inequality
initially…
Eventually a one-stop hub for making
income-related decisions combining
Census and BLS data.
intention

● Accessible via American Fact Finder (AFF).
● AFF doesn’t show distributions of individuals.
American Community Survey (ACS)

Current Population Survey (CPS)
● Limits:
● Medians falling in the upper, open-ended interval are
plugged with "$250,000”
● The data sets aggregate everyone above $100,000 together
● Limitations on job-to-job comparison
● Granularity of breakdowns

ACS Public Use Microdata Sample
(PUMS)
71
● Very Rich Data Set
● Difficult To Use

The MIDAAS Project
https://midaas.commerce.gov

The lives of too many girls of
color is characterized by:
Early Sexual Abuse, Chronic Aversive Stress ➪
School Failure ➪ Sexual Exploitation ➪ Prison.

12% African-American girls
7% of Native American girls
6% of white boys
2% of white girls.
Every year, girls of color are suspended from
school at higher rates than any other group
Annual Suspension Rates
Many of these girls are disproportionately funneled
through the juvenile justice system.

Girls are the fastest growing segment of the
juvenile justice system.
US Population Detained and
Committed
African American
Girls
14% 32%
Native American
Girls
1% 3.5%

How Do We Use Data to Address
This Problem?

Help Girls of Color
http://www.helpgirlsofcolor.org

Dr Tyrone W A Grandison
Deputy Chief Data Officer
tgrandison@doc.gov
commerce.gov/dataservice
github.com/CommerceDataService

Mais conteúdo relacionado

Mais procurados

Research issues in the big data and its Challenges

Kathirvel Ayyaswamy

Data journalism Overview

Alexander Howard

Online text data for machine learning, data science, and research - Who can p...

Fredrik Olsson

The art and science of data-driven journalism

Alexander Howard

Collab Space DC Open Data

Alexander Howard

Foresight Analytics

suresh sood

Data journalism in the second machine age

Alexander Howard

Privacy in the Age of Big Data

Arab Federation for Digital Economy

Big Data-Job 2

Roshan Barua

NATO Workshop on Pre-Detection of Lone Wolf Terrorists of the Future

Jerome Glenn

World Future Society talk on Work/Technologh Global 2050 scenarios

Jerome Glenn

Big Data Paper

Andile Ngcaba

Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...

Sirris

The Information Economy

Shaishav Dahal

Big data, big opportunities

Chouaieb NEMRI

Big Data for International Development

Alex Rascanu

Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...

AnthonyOtuonye

A Review Paper on Big Data: Technologies, Tools and Trends

IRJET Journal

Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddj

Mirko Lorenz

Due to technological advances, vast data sets (e.g. big data) are increasing now days. Big Data a new term; is used to identify the collected datasets. But due to their large size and complexity, we cannot manage with our current methodologies or data mining software tools to extract those datasets. Such datasets provide us with unparalleled opportunities for modelling and predicting of future with new challenges. So as an awareness of this and weaknesses as well as the possibilities of these large data sets, are necessary to forecast the future. Today’s we have an overwhelming growth of data in terms of volume, velocity and variety on web. Moreover this, from a security and privacy views, both area have an unpredictable growth. So Big Data challenge is becoming one of the most exciting opportunities for researchers in upcoming years. Hence this paper discuss about this topic in a broad overview like; its current status; controversy; and challenges to forecast the future. This paper defines at some of these problems, using illustrations with applications from various areas. Finally this paper discuss secure management and privacy of big data as one of essential issues.

Mining Big Data to Predicting Future

IJERA Editor

Mais procurados (20)

Research issues in the big data and its Challenges

Data journalism Overview

Online text data for machine learning, data science, and research - Who can p...

The art and science of data-driven journalism

Collab Space DC Open Data

Foresight Analytics

Data journalism in the second machine age

Privacy in the Age of Big Data

Big Data-Job 2

NATO Workshop on Pre-Detection of Lone Wolf Terrorists of the Future

World Future Society talk on Work/Technologh Global 2050 scenarios

Big Data Paper

Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...

The Information Economy

Big data, big opportunities

Big Data for International Development

Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...

A Review Paper on Big Data: Technologies, Tools and Trends

Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddj

Mining Big Data to Predicting Future

Destaque

The U.S. Department of Commerce collects, processes and disseminates data on a range of issues that impact our nation. Having a host of data and ensuring that this data is open and accessible to all are two separate issues. This session will cover the Commerce Data Usability Project (CDUP) - a community-driven public-private partnership to help data scientists, programmers and other users to access open knowledge from our open data.

Enabling Data-Driven Private-Public Collaborations

Tyrone Grandison

Sådan bruger vi MailChimp

Brønderslev Erhverv

The primary objective of the paper is to help better understand the role of SNS use in explanations of the effects of Internet use on three dimensions of quality of life (life satisfaction, knowledge, and sociability). The second objective is to test the possible substitutive role of the basic parameters of a respondent's social network (size, heterogeneity, and network capital), other online information and communication activities, innovativeness, digital skills and sociodemographic variables.

Presentation at Social Media & Society 2014 conference, Toronto

Petr Lupac

Open source nahsl

Shane Sher

Be proactive

Simon Misiewicz

Phree photo editing l

Shane Sher

desh birthday

epadofina

Stary basarab

Marek Starý

Spotlight with Imtiaz Ali & nexGTv

akulsingh

Premios grammy

cerezithaexotik

Attitude

thandastuff

In this session I will show how SharePoint 2013 can be used to deliver Mobile web solutions for a wide range of use case scenarios: -Retail data collection -Emergency/Disaster relief service -On-Site inspection -Time sheets -Help Desk I will go through how the mobile web solutions work, what considerations have been made and what value has been provided when building these mobile web solutions using SharePoint 2013. I hope every attendee walks away with an expanded horizon of what they could do with mobile web solutions in their SharePoint environment.

The power of share point mobile solutions - NYC 2016

tonerz

Ben's two year presentation

judygio

产品设计与用户体验 - 马化腾

zhuxiongjie

Justin beiber[1]

cerezithaexotik

The following Course will focus mainly on a private Virtual Environment such VirtualBox or VMware Station. However, if you are willing to setup straight on DigitalOcean or Vultr, then you can skip Course1 and jump to Course2. But, I highly recommend to go through Course1 to build In-house local Web Hosting Server for testing or developing purpose. After all, the concept is same on either Private or Public Virtual environment.

Course 1: Create and Prepare Ubuntu 12.04 VM Template

Imad Daou

Chief Data Officers At Work

Tyrone Grandison

Andrea Johnson--Stage Management

kavitamenon1

Stephen's ap gov f inal project.

stepheniscool2

Convert21189 2

KHulsy

Destaque (20)

Enabling Data-Driven Private-Public Collaborations

Sådan bruger vi MailChimp

Presentation at Social Media & Society 2014 conference, Toronto

Open source nahsl

Be proactive

Phree photo editing l

desh birthday

Stary basarab

Spotlight with Imtiaz Ali & nexGTv

Premios grammy

Attitude

The power of share point mobile solutions - NYC 2016

Ben's two year presentation

产品设计与用户体验 - 马化腾

Justin beiber[1]

Course 1: Create and Prepare Ubuntu 12.04 VM Template

Chief Data Officers At Work

Andrea Johnson--Stage Management

Stephen's ap gov f inal project.

Convert21189 2

Semelhante a Creating a Data-Driven Government: Big Data With Purpose

Ppt shark global forum session 3 2012 v4

GlobalForum

The REAL Impact of Big Data on Privacy

Claudiu Popa

If you ask people what BIG DATA is they often say it is about a lot of data. But the world has ALWAYS had a lot of data. It is about datafication – a word so new even spellcheck functions don’t know it is a real word! Learn more about: » How BIG DATA changes career paths of even the most unsuspecting? » How BIG DATA changes the way business decision are made? » How BIG DATA changes who makes those decisions & the reshuffle of the balance of power it causes? » What BIG DATA skills can you bring to the office tomorrow to increase your value to the firm

BIG DATA | How to explain it & how to use it for your career?

Tuan Yang

Data mining with big data implementation

Sandip Tipayle Patil

"Big data" has been around for a few years now but for every hundred people talking about it there’s probably only one actually doing it. As a result Big Data has become the preferred vehicle for inflated expectations and misguided strategy. As always, the seed of the issue is in the expression itself. Big Data is not so much about a quality of the data or the tools to mine it, it’s about a new approach to product, policy or business strategy design. And that’s way harder and trickier to implement than any new technology stack. In this talk we look at where Big Data is going, what are the real opportunities, limitations and dangers and what can we do to stop talking about it and start doing it today.

Heavy, Messy, Misleading: why Big Data is a human problem, not a tech one

Pulsar

data, big data, open data

Vincenzo Patruno

"Big data" has been around for a few years now but for every hundred people talking about it there’s probably only one actually doing it. As a result Big Data has become the preferred vehicle for inflated expectations and misguided strategy. As always, language holds the key and the seed of the issue is reflected in the expression itself. "Big Data" is not so much about a quality of the data or the tools to mine it, it’s about a new approach to product, policy or business strategy design. And that’s way harder and trickier to implement than any new technology stack. In this talk I look at where Big Data is going, what are the real opportunities, limitations and dangers and what can we do to stop talking about it and start doing it today.

Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.

Francesco D'Orazio

Thinkful DC - Intro to Data Science

TJ Stalcup

Data Science Innovations

suresh sood

NPTEL BIG DATA FULL PPT BOOK WITH ASSIGNMENT SOLUTION RAJIV MISHRA IIT PATNA...

SayantanRoy14

Data science for everyone

Pranavathiyani G

Heavy, Messy, Misleading: How Big Data is a human problem, not a tech one

Pulsar Platform

Big Data

Reddhi Basu

Big data analytics

jeyaperumal

Big data is a term that describes a large or complex data volume. That data volume can be processes using traditional data processing software or techniques that are insufficient to deal with them. But big data is often noisy, heterogeneous, irrelevant and untrustworthy. As the speed of information growth exceeds Moore’s Law at the beginning of this new century, excessive data is making great troubles to human beings. However this data with special attributes can’t be managed and processed by the current traditional software system, which become a real problem. In this paper was discussed some big data challenges and problems that are faced by organizations. These challenges may relate heterogeneity, scale, timelines, privacy and human collaboration. Survey method was used as a theoretical solution framework. Survey method consists of a questionnaires report. Questionnaires report consists of all challenges and problems faced by organizations. After knowing the problem and challenges of organizations, a solution was given to organization to solve big data challenges.

Big Data Challenges faced by Organizations

IJCSIS Research Publications

Closing the Big Data Gap in Public Sector

SAP Asia Pacific

Hosted by TechSoup on February 13, 2023. https://events.techsoup.org/e/mykxzr/ Nonprofit organizations can use data to help communities and funders better understand their work. But how do you know which data to use? And where do you find it? And critically: once you have data to share, how can you use it to tell a story about your organization? TechSoup is collaborating with DataCommons.org and Tech Impact’s Data Innovation Lab to help answer these questions. We know that organizing the data you need in a meaningful way can be difficult, especially if the data comes from many different places. In this webinar, you will learn how DataCommons.org helps to address this challenge, and how we are working together to make it as easy as possible for small organizations to use public data to share stories about their work and impact.

How Can Public Data Help Your Organization? An Introduction to DataCommons.org

TechSoup

big-data.pdf

aditi276464

Abstract: Big Data concern large-volume, complex, growing data sets with multiple, autonomous sources. With the fast development of networking, data storage, and the data collection capacity, Big Data are now rapidly expanding in all science and engineering domains, including physical, biological and biomedical sciences. This paper presents a HACE theorem that characterizes the features of the Big Data revolution, and proposes a Big Data processing model, from the data mining perspective. This data-driven model involves demand-driven aggregation of information sources, mining and analysis, user interest modeling, and security and privacy considerations. We analyze the challenging issues in the data-driven model and also in the Big Data revolution.

Data Mining With Big Data

Muhammad Rumman Islam Nur

Big Data is regularly in the news with claims that that it will improve decision making and support the development of artificial intelligence. The defence training and simulation community could also exploit these advances, but the data that it does have is typically locked away in disparate unconnected proprietary systems and as such is not “big”. What might the opportunities and challenges be if such stovepiping was overcome?

It’s Big Data but Where Is It?

Andy Fawkes

Semelhante a Creating a Data-Driven Government: Big Data With Purpose (20)

Ppt shark global forum session 3 2012 v4

The REAL Impact of Big Data on Privacy

BIG DATA | How to explain it & how to use it for your career?

Data mining with big data implementation

Heavy, Messy, Misleading: why Big Data is a human problem, not a tech one

data, big data, open data

Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.

Thinkful DC - Intro to Data Science

Data Science Innovations

NPTEL BIG DATA FULL PPT BOOK WITH ASSIGNMENT SOLUTION RAJIV MISHRA IIT PATNA...

Data science for everyone

Heavy, Messy, Misleading: How Big Data is a human problem, not a tech one

Big Data

Big data analytics

Big Data Challenges faced by Organizations

Closing the Big Data Gap in Public Sector

How Can Public Data Help Your Organization? An Introduction to DataCommons.org

big-data.pdf

Data Mining With Big Data

It’s Big Data but Where Is It?

Mais de Tyrone Grandison

‘Science for social justice’ may only be achieved when politicians, decision-makers and science-policymakers set a considered and thoughtful agenda to utilize science, in reasoned and innovative ways, as a driving force for positive societal change to promote equity through innovation. However, to date, tangible results in many contexts have been mixed at best, especially in delivering a reliable mechanism for, or a path to, sustainable social equity and justice for all. As global inequality increases and much political decision-making remains myopic and contingent, the emotive and essential power of ‘science for social justice’ can be lost as scientists and decision-makers struggle to actualize meaningful change. We, as scientists, in collaboration with our decision-making peers, have a golden opportunity to correct this through clear and novel proposals for meaningful projects based on advanced research opportunities. In this regard, we contend that ‘science for social justice’ can only be fully realized if it is symbiotically connected to providing scientific opportunity, where no such opportunity previously existed. This inevitably foments and sustains prosperity, an essential factor for social justice to grow. Therefore, the goal must be to establish opportunity that serves as the bridge to prosperity. How can we accomplish this when most of the world relies on relatively few countries for new scientific advances and technologies?

Global Scientific Research as a Tool to Unlock and Engage Talent and Expand t...

Tyrone Grandison

Learning From the COViD-19 Global Pandemic

Tyrone Grandison

Technology is an integral part of our everyday lives through broad-band internet usage, protection of cyber-security security, or the usage of artificial intelligence (AI) to mimic human-operations. Historically, technology has perpetuated racial discrimination with biases in algorthims used in the health-care system, facial recognition in the criminal justice system, to Black and Latinx students lacking access to technological resources. This panel will discuss the historical context of racism in technology, current technology access issues in communities of color, as well as strategies and policies that dismantle systemic racism in technology.

Systemic Barriers in Technology: Striving for Equity and Access

Tyrone Grandison

COVID and the Ederly

Tyrone Grandison

Are There Ethical Limits to What Science Can Achieve or Should Pursue?

Tyrone Grandison

Using Data and Computing for the Greater Good

Tyrone Grandison

How to effectively collaborate with your IT Departments to Develop Secure IA ...

Tyrone Grandison

DOES innovation Lab Launch

Tyrone Grandison

Creating Chandler's IT Strategic Plan

Tyrone Grandison

Inventing with Purpose, Intention and Focus

Tyrone Grandison

We live in an amazing time. The only barrier to impact is execution. Every individual has the opportunity to take an idea from inception to invaluable and innovative solution in a matter of months. Every nation has the capacity, and the capability, to create a solid foundation for its citizens that has the potential to transform lives and sustain a thriving innovation ecosystem. This talk will examine the part that each of us must play in creating an innovation nation.

Becoming a Nation of Innovation

Tyrone Grandison

The mission of the IHME is to apply rigorous measurement and analysis to help policy makers make better decisions on a range of health policy issues. Like other organizations, the IHME have embraced containers and micro-services aggressively to better support hundreds of collaborating researchers. In addition to containerized workloads, the IHME run a wide-variety of traditional analytic, simulation and high-performance computing workloads on an HPC cluster with 15,000 cores and 13PB of storage. Researchers increasingly need to combine both containerized and non-containerized elements into workflow pipelines, and a key challenge has been ensuring SLAs for various departments and avoiding duplicate infrastructure and unnecessary data movement and duplication. In collaboration with industry partners, IHME have deployed a unique solution based on Univa’s Navops technology that allows them to combine containerized and traditional analytic and high-performance application workloads on a single shared Kubernetes cluster, ensuring departmental SLAs and helping contain infrastructure costs. In this talk Dr. Grandison will discuss IHME, their experience deploying containerized applications and how they went about using Kubernetes to support a variety of new containerized applications as well as a variety of traditional analytic applications.

Running Mixed Workloads on Kubernetes at IHME

Tyrone Grandison

It often goes unnoticed that the majority of innovations today stems from investments by government bodies to produce platforms, software and data for the greater societal good. The Internet, the Global Positioning System, voice-controlled software are all examples of these investments. The private industry has no business case for undertaking these efforts; as the business model and return on investment is often unknown. These well-known examples started as military projects in search of ethical commercial use cases. Private industry is often the biggest benefactors of the production of these systems. In this talk, I will speak about the cycle of open innovation, highlight a few examples, discuss what went and is wrong, and highlight course corrections. Specifically, the focus will be initiatives that were intentionally meant to be open , like weather data from NOAA, survey data from the Census Bureau, GDP (Gross Domestic Product) data from the Bureau of Economic Analysis, and public health data from the Institute of Health Metrics and Evaluation.

The Power Of Open

Tyrone Grandison

ISPAB Presentation - The Commerce Data Service

Tyrone Grandison

Building APIs in Government for Social Good

Tyrone Grandison

Strategies and Tactics for Accelerating IT Modernization

Tyrone Grandison

The Creative Economy within the United States of America

Tyrone Grandison

Security and Privacy in Healthcare

Tyrone Grandison

Publishing in Biomedical Data Science

Tyrone Grandison

The Big Think

Tyrone Grandison

Mais de Tyrone Grandison (20)

Global Scientific Research as a Tool to Unlock and Engage Talent and Expand t...

Learning From the COViD-19 Global Pandemic

Systemic Barriers in Technology: Striving for Equity and Access

COVID and the Ederly

Are There Ethical Limits to What Science Can Achieve or Should Pursue?

Using Data and Computing for the Greater Good

How to effectively collaborate with your IT Departments to Develop Secure IA ...

DOES innovation Lab Launch

Creating Chandler's IT Strategic Plan

Inventing with Purpose, Intention and Focus

Becoming a Nation of Innovation

Running Mixed Workloads on Kubernetes at IHME

The Power Of Open

ISPAB Presentation - The Commerce Data Service

Building APIs in Government for Social Good

Strategies and Tactics for Accelerating IT Modernization

The Creative Economy within the United States of America

Security and Privacy in Healthcare

Publishing in Biomedical Data Science

The Big Think

Último

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Juan lago vázquez

Tracing the root cause of a performance issue requires a lot of patience, experience, and focus. It’s so hard that we sometimes attempt to guess by trying out tentative fixes, but that usually results in frustration, messy code, and a considerable waste of time and money. This talk explains how to correctly zoom in on a performance bottleneck using three levels of profiling: distributed tracing, metrics, and method profiling. After we learn to read the JVM profiler output as a flame graph, we explore a series of bottlenecks typical for backend systems, like connection/thread pool starvation, invisible aspects, blocking code, hot CPU methods, lock contention, and Virtual Thread pinning, and we learn to trace them even if they occur in library code you are not familiar with. Attend this talk and prepare for the performance issues that will eventually hit any successful system. About authorWith two decades of experience, Victor is a Java Champion working as a trainer for top companies in Europe. Five thousands developers in 120 companies attended his workshops, so he gets to debate every week the challenges that various projects struggle with. In return, Victor summarizes key points from these workshops in conference talks and online meetups for the European Software Crafters, the world’s largest developer community around architecture, refactoring, and testing. Discover how Victor can help you on victorrentea.ro : company training catalog, consultancy and YouTube playlists.

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

Victor Rentea

Artificial Intelligence Chap.5 : Uncertainty

Khushali Kathiriya

Angeliki Cooney has spent over twenty years at the forefront of the life sciences industry, working out of Wynantskill, NY. She is highly regarded for her dedication to advancing the development and accessibility of innovative treatments for chronic diseases, rare disorders, and cancer. Her professional journey has centered on strategic consulting for biopharmaceutical companies, facilitating digital transformation, enhancing omnichannel engagement, and refining strategic commercial practices. Angeliki's innovative contributions include pioneering several software-as-a-service (SaaS) products for the life sciences sector, earning her three patents. As the Senior Vice President of Life Sciences at Avenga, Angeliki orchestrated the firm's strategic entry into the U.S. market. Avenga, a renowned digital engineering and consulting firm, partners with significant entities in the pharmaceutical and biotechnology fields. Her leadership was instrumental in expanding Avenga's client base and establishing its presence in the competitive U.S. market.

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...

Angeliki Cooney

Join our latest Connector Corner webinar to discover how UiPath Integration Service revolutionizes API-centric automation in a 'Quote to Cash' process—and how that automation empowers businesses to accelerate revenue generation. A comprehensive demo will explore connecting systems, GenAI, and people, through powerful pre-built connectors designed to speed process cycle times. Speakers: James Dickson, Senior Software Engineer Charlie Greenberg, Host, Product Marketing Manager

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

DianaGray10

Effective data discovery is crucial for maintaining compliance and mitigating risks in today's rapidly evolving privacy landscape. However, traditional manual approaches often struggle to keep pace with the growing volume and complexity of data. Join us for an insightful webinar where industry leaders from TrustArc and Privya will share their expertise on leveraging AI-powered solutions to revolutionize data discovery. You'll learn how to: - Effortlessly maintain a comprehensive, up-to-date data inventory - Harness code scanning insights to gain complete visibility into data flows leveraging the advantages of code scanning over DB scanning - Simplify compliance by leveraging Privya's integration with TrustArc - Implement proven strategies to mitigate third-party risks Our panel of experts will discuss real-world case studies and share practical strategies for overcoming common data discovery challenges. They'll also explore the latest trends and innovations in AI-driven data management, and how these technologies can help organizations stay ahead of the curve in an ever-changing privacy landscape.

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc

In this keynote, Asanka Abeysinghe, CTO,WSO2 will explore the shift towards platformless technology ecosystems and their importance in driving digital adaptability and innovation. We will discuss strategies for leveraging decentralized architectures and integrating diverse technologies, with a focus on building resilient, flexible, and future-ready IT infrastructures. We will also highlight WSO2's roadmap, emphasizing our commitment to supporting this transformative journey with our evolving product suite.

Platformless Horizons for Digital Adaptability

WSO2

Accelerating FinTech Innovation: Unleashing API Economy and GenAI Vasa Krishnan, Chief Technology Officer - FinResults Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

apidays

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Edi Saputra

Keynote 2: APIs in 2030: The Risk of Technological Sleepwalk Paolo Malinverno, Growth Advisor - The Business of Technology Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

apidays

Dubai, known for its towering skyscrapers, luxurious lifestyle, and relentless pursuit of innovation, often finds itself in the global spotlight. However, amidst the glitz and glamour, the emirate faces its own set of challenges, including the occasional threat of flooding. In recent years, Dubai has experienced sporadic but significant floods, disrupting normalcy and posing unique challenges to its infrastructure. Among the critical nodes in this bustling metropolis is the Dubai International Airport, a vital hub connecting the world. This article delves into the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Orbitshub

Elevate Developer Efficiency & build GenAI Application with Amazon Q

Bhuvaneswari Subramani

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

Dubai, often portrayed as a shimmering oasis in the desert, faces its own set of challenges, including the occasional threat of flooding. Despite its reputation for opulence and modernity, the emirate is not immune to the forces of nature. In recent years, Dubai has experienced sporadic but significant floods, testing the resilience of its infrastructure and communities. Among the critical lifelines in this bustling metropolis is the Dubai International Airport, a bustling hub that connects the city to the world. This article explores the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

Orbitshub

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

MadyBayot

CNIC Information System with Pakdata Cf In Pakistan

danishmna97

MS Copilot expands with MS Graph connectors

Nanddeep Nachan

💥 You’re lucky! We’ve found two different (lead) developers that are willing to share their valuable lessons learned about using UiPath Document Understanding! Based on recent implementations in appealing use cases at Partou and SPIE. Don’t expect fancy videos or slide decks, but real and practical experiences that will help you with your own implementations. 📕 Topics that will be addressed: • Training the ML-model by humans: do or don't? • Rule-based versus AI extractors • Tips for finding use cases • How to start 👨‍🏫👨‍💻 Speakers: o Dion Morskieft, RPA Product Owner @Partou o Jack Klein-Schiphorst, Automation Developer @Tacstone Technology

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam

UiPathCommunity

The microservices honeymoon is over. When starting a new project or revamping a legacy monolith, teams started looking for alternatives to microservices. The Modular Monolith, or 'Modulith', is an architecture that reaps the benefits of (vertical) functional decoupling without the high costs associated with separate deployments. This talk will delve into the advantages and challenges of this progressive architecture, beginning with exploring the concept of a 'module', its internal structure, public API, and inter-module communication patterns. Supported by spring-modulith, the talk provides practical guidance on addressing the main challenges of a Modultith Architecture: finding and guarding module boundaries, data decoupling, and integration module-testing. You should not miss this talk if you are a software architect or tech lead seeking practical, scalable solutions. About the author With two decades of experience, Victor is a Java Champion working as a trainer for top companies in Europe. Five thousands developers in 120 companies attended his workshops, so he gets to debate every week the challenges that various projects struggle with. In return, Victor summarizes key points from these workshops in conference talks and online meetups for the European Software Crafters, the world’s largest developer community around architecture, refactoring, and testing. Discover how Victor can help you on victorrentea.ro : company training catalog, consultancy and YouTube playlists.

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

Victor Rentea

ICT role in 21st century education and its challenges

rafiqahmad00786416

Creating a Data-Driven Government: Big Data With Purpose

1. Creating a Data-Driven Government Big Data With Purpose Dr Tyrone W A Grandison Deputy Chief Data Officer

9. << Log(radiances) >> The US as a histogram

10. dim light average light intense light radiance roughly proxies for people activity

11. commercedataservice.github.io/tutorial_viirs_part1

12. •MATRIX OF HISTOGRAMS commercedataservice.github.io/tutorial_viirs_part1

13. Two histogram comparison New York City Las Vegas commercedataservice.github.io/tutorial_viirs_part1

14. Y(Labor Forcei) . . . X(Radiancei,j … Radiancei,n) . . . = commercedataservice.github.io/tutorial_viirs_part1

15.

16. Illegal fishing gas flares population ?!?!?!

17.

18.

19.

20. growth and opportunity 69,583 datasets ~ 35.9%

21.

22.

23.

24. government takes on the hardest, inelastic problems

25.

26. “What’s your stack?” “How fast is your GPU cluster in traversing the graph? “Are you a Spark guy?”

27. Micro Touch Long touch vs

28. In government, there’s a lot more to algorithmic accuracy than a score. TPR AUC F-1 Prec. MSE MAPE

29. signal + purpose

30. signal + purpose useful information

31. signal + purpose direction, meaning

32. signal + purpose viability

33. optimum

34. n-dimensional data

35. • A reason for existence • Access to the field • Access to actionable data • Ethical intervention points • Methodologically defensible yet intellectually accessible • Path to sustainability Six conditions for data awesomeness

36. Influence strategy and operations Seed for innovation

37. 40 Projects

38. Algorithmic Intelligence For New Exporters

39.

40. Our Client, Our Goal

41. New Exporters Project

42. XX,XXX

43.

44. Case: Who is export-ready and to what degree? Unsupervised Learning with a hint of supervised learning Differentiated services for new markets

45. Case: A trade specialist in rural America may need to drive 2 hours to meet a potential exporter. Conversion Scoring Problem Know your utility before you go

46. Case: Which positions in a company are like to use which services? Transition probabilities Sets expectations

47. We’re just getting started.

48. Data Education

49. Upskill through data education to seed for change and improvement

50. Commerce Data Academy

51. Start small: an experiment 4 Three-hour course taught by General Assembly

52. Pilot Results 422 Registrations 90% Attendance rate

53. Data Science I: Basics / Working with Teams (Git and GitHub) / Intro to Object- Oriented Programming (Python & JavaScript) / Using APIs (Intro to REST) / Intro to Photoshop / Intro to Python / Basic SQL (Using Sqlite3) / Building APIs / Intro to R / Intro to JavaScript / Intro to Data Analysis with Python / Data Wrangling with pandas / Agile Development / HTML + CSS / Storytelling with Data / Excel / Intro to Machine Learning / Visual Analytics with Python / Data Storytelling with R

54. 2016 Season (Scale Experiment) 14 Three-hour course taught by Commerce Data Service staff Two-week intensives on data science and data visualization via General Assembly 2 Option to be a data scientist or data engineer-in-residence

55. Initial Response 3,500 Registrations 15 Participants for In-Residence program 10 Bureaus represented 1 Model forked by another federal agency

56. 4x more courses 6.9x growth in interest unlimited potential

57. the upshot Data skills are now a ”thing” + there is an internal market

58. Data Usability

59. Commerce Data valuable, open, big, under- utilized, unused

60. Commerce Data Usability Project commerce.gov/datausability

61.

62. Find the right users Understand security Find affordable housing Determine hail risk Predict rainfall and flooding Determine human activity; using satellite data Help with Water Management

63.

64. a novel analysis or question posed to the data — visually arresting graphics and engagement with the public — open, free code and data for the public to use Contribute

65. Income Inequality

66. Income Inequality is a hard topic to interact with… So people don’t.

67. How might we create a better ‘conversation’ and/or experience with data around income inequality? purpose

68. Create a basis of knowledge for Americans on income inequality initially… Eventually a one-stop hub for making income-related decisions combining Census and BLS data. intention

69. ● Accessible via American Fact Finder (AFF). ● AFF doesn’t show distributions of individuals. American Community Survey (ACS)

70. Current Population Survey (CPS) ● Limits: ● Medians falling in the upper, open-ended interval are plugged with "$250,000” ● The data sets aggregate everyone above $100,000 together ● Limitations on job-to-job comparison ● Granularity of breakdowns

71. ACS Public Use Microdata Sample (PUMS) 71 ● Very Rich Data Set ● Difficult To Use

72. The MIDAAS Project https://midaas.commerce.gov

73. School-to-Prison Pipeline

74. The lives of too many girls of color is characterized by: Early Sexual Abuse, Chronic Aversive Stress ➪ School Failure ➪ Sexual Exploitation ➪ Prison.

75. 12% African-American girls 7% of Native American girls 6% of white boys 2% of white girls. Every year, girls of color are suspended from school at higher rates than any other group Annual Suspension Rates Many of these girls are disproportionately funneled through the juvenile justice system.

76. Girls are the fastest growing segment of the juvenile justice system. US Population Detained and Committed African American Girls 14% 32% Native American Girls 1% 3.5%

77. How Do We Use Data to Address This Problem?

78. Help Girls of Color http://www.helpgirlsofcolor.org

79. Stay tuned.

80. Dr Tyrone W A Grandison Deputy Chief Data Officer tgrandison@doc.gov commerce.gov/dataservice github.com/CommerceDataService

Notas do Editor

On October 28, 2011, a Delta II rocket took off from Vandenberg Air Force Base in California.
Onboard was the Suomi NPP satellite, a nearly 2000 kg satellite with the mission of adding to the environmental and climate data records of the Earth; helping us to better understand society. The satellite mission was made possible by a partnership between the National Oceanic and Atmospheric Administration (NOAA) and NASA.
Onboard, NPP carries various instruments that collect information about the earth system. One particular instrument, the Visible Infrared Imaging Radiometer Suite or VIIRS -- a 277 kg imaging device -- holds the potential to understand earth in unprecedented ways.
While NPP flies over a sun synchronous orbit, the VIIRS instrument goes to work. It can see everything from: - atmospheric conditions, clouds, the earth radiation budget, clear-air land/water surfaces, sea surface temperature, ocean color, and low light visible imagery. It also captures nighttime lights, enabling far ranging applications.
Looking at the continental US, nighttime lights are distributed in non-random patterns.
On a macroscale, we can see all the interconnectedness of large cities to towns with the arteries in between.
We can also see activity on the high seas, with boats and oil rigs in the Gulf coast.
And, It’s more than a pretty picture. It’s data. It’s big data. In fact, the US nighttime lights profile can be turned a histogram. Think about taking a photo of the US from space using your nifty digital camera and then having a histogram of the lights. We basically are binning the light so we know how many pixels fall into each level of light intensity.
And that light intensity holds the potential to understand population dynamics -- we could ballpark the number of people on the ground -- allowing researchers to tie it to labor force estimates and economic output. This representation of data holds clues to how society collectively behaves. Let's put it into an example
Let’s zoom in a bit on the 35 largest metro areas in the US See the spider web patterns and the clustering light. That indicates patterns in urban development, sprawl, economic activity, residential activity. And using nighttime lights we can quantify it.
In fact when we breakdown satellite imagery into histograms, we can see clear differences in the amount and intensity of light. Cities with less light will have smaller histograms. Cities with more light and higher population density will have a tail to the right. More clustered the central business district is in small cities, longer the right tail.
In New York, the light distribution has a mix of dim and bright lights. But in Las Vegas, it’s dimmer with one super bright urban core. One intensely bright pixel in one city will not mean the same as the same bright pixel in another. The clustering, residential, employment will also differ.
Our team is experimenting with ways to convert the signal into more timely measures of society and the economy. And find where we can develop derivative data series. The key to new data-driven societal insights is somewhere in that data.
But we're certainly not the first to take a crack at it and it doesn’t take much effort to find brilliant scientists at Commerce who are finding ways to use the data. For example, Dr Chris Elvidge -- a remote sensing scientist based out of NOAA’s Boulder Research Facility -- has spent most of his career drumming up ways of using nighttime imagery.
Using VIIRS, he has found ways to detect: illegal fishing, the location and spread of wildfires and gas flares that add greenhouse emissions. Also, VIIRS can help estimate GDP and other social indicators, especially in the rural parts of developing world as well as measure the ROI of electrification projects.
The data is there. It’s collected everyday. And there is more there than many of us could imagine.
Just from the VIIRS instrument, we collect about 2.5 terabytes of raw data per day that expands out to much more when we consider all the processed data. This is what Commerce is about. We collect some of the highest value data around, find ways to use it to advance and better society and the economy
This is what my team is about. I’m part of the leadership team of the Commerce Data Service, a new data startup within the Office of the Secretary, where I lead data science initiatives advancing the missions of the 12 bureaus of Commerce. The Data Service was established in November last year and we've been quickly growing and moving to take on some of the hard problems across the bureaus...
Bureaus like the Census Bureau, NOAA, the Patent + Trademark Office, Bureau of Economic Analysis among other agencies that produce about 36% of the federal open data available through data.gov. Essentially, we're one of the data big dogs.
As the Deputy Chief Data Officer of the US Department of Commerce, I have this extraordinary privilege of working with among the brightest scientists and policy makers in the country.
We have satellites and radar stations that help us understand the environment.
We conduct well over 200 of the highest quality demographic and economic surveys in the world, which supports research on trade, urban planning and schooling.
And it's not for nothing. I'd like to take you through what it means to work on data projects in government. Government takes on the hardest problems and we need data to take on those problems. If any one person needs help and asks for help, it’s the government that needs to step up to the challenge, whether it’s for defense, homelessness, housing, healthcare, education or the economy.
According to the Census Bureau, we have nearly 320 million Americans. That’s 320 million customers. At the Commerce Data Service, we are doing our part by helping to make government more data-driven. But given the nature of our portfolio, we have to work differently.
I often hear people start a data conversation with “what’s your stack?”, “how fast is your GPU cluster?”, “are you a spark guy?”. This indicates to me that someone is starting a project with technology first.
Well, the thing is, our modes of interaction with our customers are not usually through micro-touches such as purchases, likes, views. The actions of a government are mostly in long touches -- hard conversations, in person services, laws and policies to create the right conditions.
This is a hard realization for me. The first conversation a data scientist needs to have when starting a gov project is with the people out in the field. It's humbling, it's tough, but ultimately, there is more to algorithmic accuracy than the data. There’s the operational awareness. Both are equally important. We need to take a hard look at what data can actually do.
In government, data science projects need to start with conversations around signal + purpose.
Signal pertains to the substance of data. It’s about if that data even makes sense for what you want to do, if it matches the right time frames, the geographic resolution, the fidelity and reliability of the way it’s collected. There are data systems that can detect wildfires, but as amazing as it is, if it’s slightly off the decision time scale, it can’t be used. Data is an amazing national resource, but it needs to be shaped and understood.
For data to affect change, we need adoption of products. Adoption is achieved through understanding purpose. We’re here to do good. We need to have a purpose to do good.
A great mission might not have good data. Great data might not have an actionable purpose. Jointly, signal and purpose are a way to proxy for viability.
Ultimately, in government we do not have simple 1 or 2 dimensional problems,
because data is only one of n-dimensions of project when considering all else in the world.
Thus, to ensure we're doing right by the public, we've worked out a set of six conditions for data and delivery awesomeness A reason for existence: Why is there a policy, program or process? How does it work? What is the system blueprint -- tech and social. This is the key for developing a theory of change. Access to the field. We need to speak with people who actually act on information and understand how they view new products and data. It's ultimately about them. Access to actionable data. We need to be able to dive quickly and deeply into the data to find signal , as a data product without signal in the data is just a pretty picture. Ethical intervention points. Using the social blueprint, we need to find an intervention point where a data science product would make sense. Methodologically defensible yet intellectually accessible. Many data scientists like to go down the path of algorithmic splendor, but we can't do that in our world as it alienates too many stakeholders. So, our work needs to be methodologically bulletproof by research standards but explainable by a generalist. Once we have buy-in, we can re-introduce that splendor Path to sustainability. Lastly, projects need an endpoint or a reason to be sustainable. And this is born out of testing.
These conditions allow us to create change, influence strategy, and seed for innovation.
And we apply this to all projects in our current portfolio of 40 projects. The vast majority are in the R&D phase, but I'd like to talk about a few projects that are now in the open.
One of efforts uses data science to help strengthen export services
And to broaden and deepen impact, ITA and the US Commercial Service, which has trade specialists in 100 cities and 75 countries worldwide, is collaborating with the Commerce Data Service to incorporate data into their US national field strategy.
Example client
We call this the New Exporters Project and it’s an effort to experiment using data science to combine ITA’s client data with commercial data sources to find untapped markets.
In a given year, ITA reaches thousands businesses, providing everything from business match making services to market reports to company due diligence.
ITA is looking to reach far more business through their business disruption initiative. By fine tuning services by customer segment, they can reach a far broader audience of businesses. Here are a few examples of what data science can do:
Think about all the companies that are export-ready and don’t know it. Using a combination of unsupervised learning and supervised learning, we’re developing fine tuned ways of searching for untouched companies, figuring out which company types are more likely to use which types of services, and migrate to a market-wide view.
How about the trade specialist in rural America may need to drive 2 hours to meet a potential exporter. That’s a huge time spend. We’re developing scoring models to figure out the potential utility of our services ahead of time before that long drive. For example, smaller manufacturing facilities may be associated with lighter touch services like market reports – so an emailed report may actually be a better first step. Likewise, small to medium sized businesses with a larger market cap in certain industry may be able to afford to invest in developing international relationships
Which positions in a company will use which services? It may be that different positions in a given company may ask for one service one service over another -- but to create a rule of thumb is a statistical research problem. Having biz dev in a title may be associated with more light touches. A CEO title may actually be a wildcard. So, having a good lead off offering could be the difference between use and non-use.
Exporting is clearly a Commerce priority. We’re just getting started.
One of the priorities at Commerce is data education and upskilling – both internally and externally.
More data skills will improve efficiency. The smallest behavioral change may scale. So, at Commerce, we’ve launched an internal initiative called the Commerce Data Academy.
Back in December, the Data Service launched the Commerce Data Academy to show what’s possible through data.
We started with a pilot of 4 three-hour classes taught by General Assembly.
And as it was a pilot, we didn’t think that we would end up with 422 registration with a 90% attendance rate. Who would’ve thought?
We then started to think… what if we went big. Hail Mary it. And expand the offering to cover JavaScript, Machine Learning, basic programing.
And we scaled it to 14 three hour class taught by our Data Service staff with 2 two-week long intensives taught by General Assembly.
We’ve seen a huge bump. Now we have 3,500 registrations. In addition, the 10 most committed public servants from the Academy are now on detail with our shop to exercise those new skills to build products and capacity for their home agencies. This model has worked out so well that at least one other agency has forked the CDA model.
4-times more courses, led to 6.9 growth in interest, really tells us that there is unlimited potential to disrupt the skills space.
The upshot is that by showing we have the skills in the open now has established data skills as a “thing” within the Department of Commerce and there is a new internal market for data products.
Another area we are focusing on is Data Usability
Commerce has some of the most highly-valued data set. Unfortunately, they are often under-utilized and unused; primarily because they are difficult to find, hard to understand and even harder to process (because many do not understand the collection constraints involved in the production of the data).
Usability of data is dependent on the context, examples, and compelling purpose. And to help open data move to open knowledge, we’re stepping up our game. We launched the Commerce Data Usability Project to publish long form tutorials that illustrate data use cases, code, and narrative around high-value, high potential data. And it's targeted at undergraduate and graduate students -- the next generation of data scientists who are hungry to learn. We’ve partnered with private sector companies, academia, and nonprofits to show how data is being used around the country.
We have a nice bench of contributors and more always coming. - Mapbox has contributed two tutorials on how to get started with interactive web maps using NOAA Global Weather Forecast data; - Zillow has produced a tutorial on analyzing housing affordability combining their data and Census data; - Earth Genome illustrated how to manipulate digital elevation model data that plays a key role in wetlands models.
We are highlighting the power of contextualizing and illuminating #OpenData. How many people here believe that #OpenData can currently help them find their customers and users? The Commerce Data Service provides very specific detail on doing just that using data from the Census American Community Survey (ACS). See http://commercedataservice.github.io/tutorial_acs_rank/. #OpenData from the Department can help businesses understand their computer security (http://commercedataservice.github.io/tutorial_nist_nvd/), find affordable housing options for their employees (http://commercedataservice.github.io/tutorial_zillow_acs/), help them determine weather risk (http://commercedataservice.github.io/tutorial_noaa_hail/), help predict rainfall and flooding issues (http://commercedataservice.github.io/tutorial_mapbox_part1/), help them determine hotbeds of human activity – using satellite data (http://commercedataservice.github.io/tutorial_viirs_part1/ ), and to help them with water management concerns (http://commercedataservice.github.io/tutorial_earthgenome/)
In the coming weeks, Microsoft and Columbia University have signed up to release a series of tutorial on how to begin to use analytical tools. Many more to come and we welcome collaborations. There is agreement out there that product gets used if people are furnished with a basic understanding of what that product is. In data and tech, free and balanced education really is a powerful tool. More and more organizations want to show how open data works for them.
Our tutorials are designed to engage data audiences, encourage adoption of datasets and associated workflows, and facilitate innovation. To do this, we’ve ensured that all tutorials are built according to the following guidelines: A novel analysis or question posed to the data Visually arresting graphics Open and free code and data for the public to use. It is important to note that we are language, method, and approach agnostic. This is what you have to do if you want to contribute to the initiative.
Income Inequality is one of the formidable challenges of our time.
However, it is a hard topic … and not many people talk about or interact with it because of this.
Our mission was to use data to drive this mission.
We want to create a data-driven platform to focus on this issue. The first thing we have to do is examine the data sources.
The ACS does not have the detail that we require.
The Census Current Population Survey (CPS) has limitations that preclude us from having a conversation on the detailed data. These limitations include: Medians falling in the upper, open-ended interval are plugged with "$250,000” The data sets aggregate everyone above $100,000 together Limitations on job-to-job comparison Granularity of breakdowns
The PUMS is the data that we choose to use. Very Rich Data Set: Individual and Household Data sets Income breakdowns by types Job breakdown by industry Geographic breakdown below State Difficult to Use: USA individual file alone is 2 Excel files!!! Data Dictionary 138 pages!!! Very specific ways to match variables that are difficult to understand
MIDAAS is an API and website that unpacks the ACS PUMS data and creates a forum for us to have that discussion.
Another issue is the School-to-Prison pipeline.
We’re just warming up. That’s just a few of the 40 projects. Big ones on the way. Stay tuned.

Creating a Data-Driven Government: Big Data With Purpose

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (20)

Semelhante a Creating a Data-Driven Government: Big Data With Purpose

Semelhante a Creating a Data-Driven Government: Big Data With Purpose (20)

Mais de Tyrone Grandison

Mais de Tyrone Grandison (20)

Último

Último (20)

Creating a Data-Driven Government: Big Data With Purpose

Notas do Editor