SlideShare uma empresa Scribd logo
1 de 10
LEXISNEXIS,
NCSTATE
OPPORTUNITIES
TIM MENZIES
COMPUTER SCIENCE,
JUNE 2015
SEBIG LAB : SE FOR BIG DATA
• Three year partnership
• New lab to explore SE methods for big data apps.
• Grow skill set of engineers:
• Assess different approaches to Big Data
• Validation of results
2
LAB PROCESSES VS
INDUSTRIAL PROCESSES
• Lab processes
• Make 10ml of
oxygen?
• Easy!
• Make 100,000 liters
per day?
• That’s another matter
3
INDUSTRIAL PROCESSES FOR
DATA MINING
4
INDUSTRIAL PROCESSES
FOR DATA MINING
5
1
23
4
5
EXPLORING NEW ALGORITHMS
• New ideas
• SVM
• Deep learning
• Ensembles
• etc
• Visualizations
• Parameter tuning
• Synonym discovery
• Incremental association
rule learning
6
1
VALIDATION STUDIES
• Independent checks of industrial results
• Optimizing validation:
• ? Mechanical Turk
• Better support tools for coding new
functionality
• Better test suites for
certifying new functionality
7
2
CAN WE MAKE BETTER
USE OF OLD KNOWLEDGE?
• Learning domain ontologies.
• Corpus definition.
• How to revise old knowledge?
• The privileged review problem.
• Transfer learning.
8
3
SUPPORT
Gather case study data
Synthetic studies
Annonymization of data
Training
• Papers
• Tutorials
• Learning information
seeking behavior
9
4
LESS IS MORE
• Reasoning via fewer, most representative
examples
• Active learning
• Early stopping
• Stack ranking (early stop)
10
5

Mais conteúdo relacionado

Semelhante a Lexisnexis june9

RFonseca HEV Intern Presentation
RFonseca HEV Intern PresentationRFonseca HEV Intern Presentation
RFonseca HEV Intern Presentation
Rogelio Fonseca
 
WINGLO CLOUD BASED LIMS - Comprehensive Solutions for Small medium and Lar...
WINGLO  CLOUD BASED LIMS  - Comprehensive Solutions for  Small medium and Lar...WINGLO  CLOUD BASED LIMS  - Comprehensive Solutions for  Small medium and Lar...
WINGLO CLOUD BASED LIMS - Comprehensive Solutions for Small medium and Lar...
Innowave Healthcare
 
Cockerill rs350-day3-what-can-be-done
Cockerill rs350-day3-what-can-be-doneCockerill rs350-day3-what-can-be-done
Cockerill rs350-day3-what-can-be-done
Riffyn
 
'How To Apply Lean Test Management' by Bob van de Burgt
'How To Apply Lean Test Management' by Bob van de Burgt'How To Apply Lean Test Management' by Bob van de Burgt
'How To Apply Lean Test Management' by Bob van de Burgt
TEST Huddle
 

Semelhante a Lexisnexis june9 (20)

Bdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchenBdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchen
 
Exploratory Testing Basics and Future
Exploratory Testing Basics and FutureExploratory Testing Basics and Future
Exploratory Testing Basics and Future
 
MongoDB World 2019: Analytics with MongoDB: The Data Warehouse You Didn't Kno...
MongoDB World 2019: Analytics with MongoDB: The Data Warehouse You Didn't Kno...MongoDB World 2019: Analytics with MongoDB: The Data Warehouse You Didn't Kno...
MongoDB World 2019: Analytics with MongoDB: The Data Warehouse You Didn't Kno...
 
CenturyLink's Game-Changing Strategies for Increasing Project Quality
CenturyLink's Game-Changing Strategies for Increasing Project QualityCenturyLink's Game-Changing Strategies for Increasing Project Quality
CenturyLink's Game-Changing Strategies for Increasing Project Quality
 
RFonseca HEV Intern Presentation
RFonseca HEV Intern PresentationRFonseca HEV Intern Presentation
RFonseca HEV Intern Presentation
 
Exploratory Testing Kari Kakkonen BTD 2017
Exploratory Testing Kari Kakkonen BTD 2017Exploratory Testing Kari Kakkonen BTD 2017
Exploratory Testing Kari Kakkonen BTD 2017
 
Neo4j GraphDay Munich - How to make your GraphDB project successful
Neo4j GraphDay Munich - How to make your GraphDB project successfulNeo4j GraphDay Munich - How to make your GraphDB project successful
Neo4j GraphDay Munich - How to make your GraphDB project successful
 
Ane Clausen - Success with Automated Regression Test revised
Ane Clausen - Success with Automated Regression Test revisedAne Clausen - Success with Automated Regression Test revised
Ane Clausen - Success with Automated Regression Test revised
 
WINGLO CLOUD BASED LIMS - Comprehensive Solutions for Small medium and Lar...
WINGLO  CLOUD BASED LIMS  - Comprehensive Solutions for  Small medium and Lar...WINGLO  CLOUD BASED LIMS  - Comprehensive Solutions for  Small medium and Lar...
WINGLO CLOUD BASED LIMS - Comprehensive Solutions for Small medium and Lar...
 
Webinar: "5 semplici passi per migliorare la Quality e i processi di Test".
Webinar: "5 semplici passi per migliorare la Quality e i processi di Test".Webinar: "5 semplici passi per migliorare la Quality e i processi di Test".
Webinar: "5 semplici passi per migliorare la Quality e i processi di Test".
 
Use Case: Airbus and Process Mining Technology
Use Case: Airbus and Process Mining TechnologyUse Case: Airbus and Process Mining Technology
Use Case: Airbus and Process Mining Technology
 
Quantifying DevOps Adoption Empirically for Demonstrable ROI
Quantifying DevOps Adoption Empirically for Demonstrable ROIQuantifying DevOps Adoption Empirically for Demonstrable ROI
Quantifying DevOps Adoption Empirically for Demonstrable ROI
 
Kellogg's gets it all with EPI-USE Labs' Managed SAP® Data Refresh Services
Kellogg's gets it all with EPI-USE Labs' Managed SAP® Data Refresh ServicesKellogg's gets it all with EPI-USE Labs' Managed SAP® Data Refresh Services
Kellogg's gets it all with EPI-USE Labs' Managed SAP® Data Refresh Services
 
What can be done to improve reproducibility?
What can be done to improve reproducibility?What can be done to improve reproducibility?
What can be done to improve reproducibility?
 
Cockerill rs350-day3-what-can-be-done
Cockerill rs350-day3-what-can-be-doneCockerill rs350-day3-what-can-be-done
Cockerill rs350-day3-what-can-be-done
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
 
'How To Apply Lean Test Management' by Bob van de Burgt
'How To Apply Lean Test Management' by Bob van de Burgt'How To Apply Lean Test Management' by Bob van de Burgt
'How To Apply Lean Test Management' by Bob van de Burgt
 
Exploratory Testing Basics and Future Kari Kakkonen Moscow 2018
Exploratory Testing Basics and Future Kari Kakkonen Moscow 2018Exploratory Testing Basics and Future Kari Kakkonen Moscow 2018
Exploratory Testing Basics and Future Kari Kakkonen Moscow 2018
 
Webinar : Nouveautés de MongoDB 3.2
Webinar : Nouveautés de MongoDB 3.2Webinar : Nouveautés de MongoDB 3.2
Webinar : Nouveautés de MongoDB 3.2
 
New trends in Agile Test Management
New trends in Agile Test ManagementNew trends in Agile Test Management
New trends in Agile Test Management
 

Mais de CS, NcState

Ai4se lab template
Ai4se lab templateAi4se lab template
Ai4se lab template
CS, NcState
 
Automated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSUAutomated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSU
CS, NcState
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1
CS, NcState
 

Mais de CS, NcState (20)

Talks2015 novdec
Talks2015 novdecTalks2015 novdec
Talks2015 novdec
 
Future se oct15
Future se oct15Future se oct15
Future se oct15
 
GALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software EngineeringGALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software Engineering
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
 
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
 
Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).
 
Icse15 Tech-briefing Data Science
Icse15 Tech-briefing Data ScienceIcse15 Tech-briefing Data Science
Icse15 Tech-briefing Data Science
 
Kits to Find the Bits that Fits
Kits to Find  the Bits that Fits Kits to Find  the Bits that Fits
Kits to Find the Bits that Fits
 
Ai4se lab template
Ai4se lab templateAi4se lab template
Ai4se lab template
 
Automated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSUAutomated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSU
 
Requirements Engineering
Requirements EngineeringRequirements Engineering
Requirements Engineering
 
172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginia172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginia
 
Automated Software Engineering
Automated Software EngineeringAutomated Software Engineering
Automated Software Engineering
 
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)
 
Tim Menzies, directions in Data Science
Tim Menzies, directions in Data ScienceTim Menzies, directions in Data Science
Tim Menzies, directions in Data Science
 
Goldrush
GoldrushGoldrush
Goldrush
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1
 
Know thy tools
Know thy toolsKnow thy tools
Know thy tools
 
The Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software DataThe Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software Data
 
What Metrics Matter?
What Metrics Matter? What Metrics Matter?
What Metrics Matter?
 

Último

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Último (20)

How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 

Lexisnexis june9

  • 2. SEBIG LAB : SE FOR BIG DATA • Three year partnership • New lab to explore SE methods for big data apps. • Grow skill set of engineers: • Assess different approaches to Big Data • Validation of results 2
  • 3. LAB PROCESSES VS INDUSTRIAL PROCESSES • Lab processes • Make 10ml of oxygen? • Easy! • Make 100,000 liters per day? • That’s another matter 3
  • 5. INDUSTRIAL PROCESSES FOR DATA MINING 5 1 23 4 5
  • 6. EXPLORING NEW ALGORITHMS • New ideas • SVM • Deep learning • Ensembles • etc • Visualizations • Parameter tuning • Synonym discovery • Incremental association rule learning 6 1
  • 7. VALIDATION STUDIES • Independent checks of industrial results • Optimizing validation: • ? Mechanical Turk • Better support tools for coding new functionality • Better test suites for certifying new functionality 7 2
  • 8. CAN WE MAKE BETTER USE OF OLD KNOWLEDGE? • Learning domain ontologies. • Corpus definition. • How to revise old knowledge? • The privileged review problem. • Transfer learning. 8 3
  • 9. SUPPORT Gather case study data Synthetic studies Annonymization of data Training • Papers • Tutorials • Learning information seeking behavior 9 4
  • 10. LESS IS MORE • Reasoning via fewer, most representative examples • Active learning • Early stopping • Stack ranking (early stop) 10 5