SlideShare uma empresa Scribd logo
1 de 48
Baixar para ler offline
Challenges in Software
Ecosystems Research
Alexander
Serebrenik
Eindhoven University
of Technology
The Netherlands @aserebrenik
Tom Mens UMons Belgium @tom_mens
Software
ecosystems in
scientific literature
0
125
250
375
500
1996199719981999200020012002200320042005200620072008200920102011201220132014
Scholar full text DBLP titles
Future challenges?
Definition of an ecosystem
Example of an ecosystem
Trends and challenges
164
authors of an article or a book
chapter on SECO, paper in
IWSECO, WEA or Big Systems 2014
141 authors with a valid email address
26* answered the survey
* response rate 18,4%, comparable with other surveys
Definition of an ecosystem
Respondent: “Defining everything as an
ecosystem. <…> The word is trend-ish and
it causes misunderstandings in the field.”
“The complex system of plant, animal, fungal, and
microorganism communities and their associated
non-living environment interacting as an ecological
unit. Ecosystems have no fixed boundaries”
[Lungu 2008]
[Jansen et al.
2009]
[Manikas,
Hansen 2013]
<biological>
communities
software
projects
actors actors
environment environment
shared markt for
software and
services, shared
platform
common
technological
platform
interaction
developed and
evolve together
exchange of
information,
resources &
artefacts
symbiotic
relationships
Definition of an ecosystem
social
technical economical
[Lungu 2008]
[Bosch, Bosch-
Sijtsema 2009]
[*Moore 1993][Jansen et al. 2009]
[Mitleton-Kelly 2003]
[Manikas,
Hansen
2013]
Definition of an ecosystem
companies
app stores
OS foundations
programming
languages
operation
systems
forges & social
ecosystems
Example of an ecosystem
Based on the
literature
companies
app stores
OS foundations
programming
languages
operation
systems
forges & social
ecosystems
Example of an ecosystem
Based on the
literature
companies
app stores
OS foundations
programming
languages
operation
systems
forges & social
ecosystems
Example of an ecosystem
Based on the
literature
companies
app stores
OS foundations
programming
languages
operation
systems
forges & social
ecosystems
Example of an ecosystem
Based on the
literature
companies
app stores
OS foundations
programming
languages
operation
systems
forges & social
ecosystems
Example of an ecosystem
Based on the
literature
Definition of an ecosystem
Example of an ecosystem
Respondent: “Defining
everything as an
ecosystem. <…> The
word is trend-ish and it
causes misunderstandings
in the field.”
social
economicaltechnical
Different perspectives on the same artefacts or different
artefacts all together?
Trends and challenges
26 survey answers Literature study
{29 challenges
8 categories
One challenge is to be able to characterize
the wealth of the community wrt the wealth
of the software components. What is the
impact of different collaboration and
development practices on the quality of the
ecosystem?”
Trends and challenges
One challenge is to be able to characterize
the wealth of the community wrt the wealth
of the software components. What is the
impact of different collaboration and
development practices on the quality of the
ecosystem?”
Trends and challenges
ecosystem quality
socio-technical
One challenge is to be able to characterize
the wealth of the community wrt the wealth
of the software components. What is the
impact of different collaboration and
development practices on the quality of the
ecosystem?”
Trends and challenges
ecosystem quality
socio-technical
SECOs may consist of many systems.
Analysing all these systems as a whole
may raise some technical problems, due to
the quantity of data to take into account.
data analytics
amount (volume)
large databases with comparable information about the details
of a large collection of ecosystems, so that any research could
be conducted in a repeatable and comparable way.
database of
comparable inforeproducible
research
Software Ecosystems
are/lead to Big Data
~
male
likes games
NYC
Privacy: digital
trace data
Privacy: surveys
Minority respondents
are easy to identify
Reproducibility vs privacy
Non-sensitive Sensitive
Zip Age Nationality Condition
1 13053 28 Russian Heart Disease
2 13068 29 American Heart Disease
3 13068 21 Japanese Viral Infection
4 13053 23 American Viral Infection
5 14853 50 Indian Cancer
6 14853 55 Russian Heart Disease
7 14850 47 American Viral Infection
8 14850 49 American Viral Infection
9 13053 31 American Cancer
10 13053 37 Indian Cancer
11 13068 36 Japanese Cancer
12 13068 35 American Cancer
Non-sensitive Sensitive
Zip Age Nationality Condition
1 130** <30 * Heart Disease
2 130** <30 * Heart Disease
3 130** <30 * Viral Infection
4 130** <30 * Viral Infection
5 1485* >40 * Cancer
6 1485* >40 * Heart Disease
7 1485* >40 * Viral Infection
8 1485* >40 * Viral Infection
9 130** 30-40 * Cancer
10 130** 30-40 * Cancer
11 130** 30-40 * Cancer
12 130** 30-40 * Cancer
Non-sensitive Sensitive
Zip Age Nationality Condition
1 130** <30 * Heart Disease
2 130** <30 * Heart Disease
3 130** <30 * Viral Infection
4 130** <30 * Viral Infection
5 1485* >40 * Cancer
6 1485* >40 * Heart Disease
7 1485* >40 * Viral Infection
8 1485* >40 * Viral Infection
9 130** 30-40 * Cancer
10 130** 30-40 * Cancer
11 130** 30-40 * Cancer
12 130** 30-40 * Cancer
Are some challenges more
important than others?
Second survey
• Group A: respondents of the previous survey that
have provided their email addresses
• 26 answers - 20 with mail, invited - 14 responses - 70%
• Group B: extended list of ecosystem experts
(outside Group A):
• 148 invited - 142 valid addresses - 38* responses ~ 27%
• Better response rate: 32.1% vs 18.4% (first survey)
* One of the respondents that provided an email has not been invited
No difference between
Group A and Group B
Adonis, Unknown,
restored by Duquesnoy
(1597–1643), Louvre
• Analysis of Similarities
(ANOSIM)
• R: -0.07564
• more dissimilar closer to 1
• Permutational Multivariate
Analysis of Variance Using
Distance Matrices (ADONIS)
• p-value: 0.192
Ordering challenges
1. Consider both groups as one set of answers
2. Per question: #very important - #moderately
important - #slightly important
3. Lexicographic order on the triples
(#very important - #moderately important - #slightly
important)
Top Three
1. Reproducible and Comparable Research [Providing
databases with information about the details of a
large collection of ecosystems]
2. Reproducible and Comparable Research [Making
research results about ecosystems available in a
reproducible way]
3. Offer more advanced ecosystems analysis (e.g., case
studies, qualitative and quantitative analysis) [Use
more advanced statistical techniques (e.g., survival
analysis, econometric aggregation, contrasts)]
4. Understanding and improving the design, architecture, quality and
health of software ecosystems [Socio-technical perspective, e.g.,
comparing the health of the community against the health of the
ecosystem components]
5. Ecosystem Governance [Design perspective, e.g., actively
supporting the stakeholders' decisions]
6. Understanding and improving an ecosystem's dynamics and
evolution [Generalisation perspective, e.g., transferring insights from
evolution of individual software systems to evolution of ecosystems]
7. Understanding and improving the design, architecture, quality and
health of software ecosystems [Social perspective, e.g., creating an
active community around the ecosystem]
8. Interdisciplinary research [Applying ecosystem research techniques
to non-classical software ecosystems, e.g., spreadsheets or Matlab
Simulink models]
9. Understanding and improving an ecosystem's dynamics and
evolution [Design perspective, e.g., providing upgrade strategies
when one of the ecosystem elements changes]
10.Ecosystem Governance [Generalisation perspective, e.g., going
beyond anecdotal evidence]
Reproducible Research: SE
problem?
Raw$data!
Processed$
data$set!
Tools$&$
scripts!
#MSR$papers$
200482009!
Y" Y" Y" 2"
Y" Y" N" 2"
Y" P" Y" 1"
Y" P" P" 2"
Y" P" N" 2"
Y" N" Y" 16"
Y" N" P" 19"
Y" N" N" 64"
P" N" Y" 1"
P" N" N" 2"
N" Y" N" 2"
N" P" N" 1"
N" N" Y" 7"
N" N" P" 2"
N" N" N" 31"
N/A" N/A" N/A" 17"
Robles 2010
Ghezzi, Gall 2013:
• Replicated 25
• Partially 27
• Not replicated 36
Reproducible and Comparable Research
[Providing databases with information about the
details of a large collection of ecosystems]
Enough?
Too big to share?
Up-to-date?
Still relevant?
1TB
Culture
http://www.nickcobbcopywriter.com/wp-content/uploads/2013/03/whats-in-it-for-me.jpg
Advanced statistics
3. Offer more advanced ecosystems analysis (e.g., case
studies, qualitative and quantitative analysis) [Use more
advanced statistical techniques (e.g., survival
analysis, econometric aggregation, contrasts)]
Advanced statistics
PAGE 2711/08/15
Two distributions:
!  t-test
!  Mann-Whitney
Multiple distributions:
1.  ANOVA / KW
2.  pairwise t-test / MW
Tests can be
inconsistent with
each other
We need a
one-phase test!
Advanced statistics
PAGE 3211/08/15
Idea:
​" 
Pair Low High
B-A -0.56 -0.44
C-A -0.50 -0.31
D-A -0.32 -0.03
C-B -0.01 0.24
D-B 0.24 0.47
D-C 0.09 0.40
A→B
A→C
A→D
D→B
D→C
Konietschke, F., Hothorn, LA, and Brunner, E.
Rank-based multiple test procedures and
simultaneous confidence intervals.
Electron. J. Stat. 6 (2012), 738–759.
~
T and Software Ecosystems
• Stack Overflow and GitHub - Vasilescu et al. Social
Com 2013
• Simulink models - Dajsuren et al. QoSA 2013
• GNOME - Vasilescu et al. ESE 2014
• Stack Exchange sites - Wang et al. ICSME 2014
• jEdit, ArgoUML, KOffice - Sun et al. Inf & Software
Technology 2015
~
Advanced statistics
Mean,
median,
sum
Gini, Theil,
Kolm…
Choice of an aggregation
technique provides different
insights but can also affect
validity of the results!
C. Gini, “Measurement of inequality of
incomes,” The Economic Journal, 1921.
H. Theil, Economics and Information Theory.
North-Holland, 1967
A.B. Atkinson, “On the measurement of
inequality,” Journal of Economic Theory,
1970.
…
Gini, Theil & Software Ecos
• Qualitas - Spasojević et al. ICSME 2014
• GNOME - Mens, Goeminne IWSECO 2011,
Vasilescu et al. ESE 2014
• Debian - Serebrenik, vd Brand ICSM 2010
• Market shares - Yu, First Monday 2012
Advanced statistics
% of entities still used
after time t?
Kaplan, E. L.; Meier, P. (1958).
"Nonparametric estimation from incomplete
observations". J. Amer. Statist. Assn. 53
(282): 457–481
Survival & Software Ecos
• FLOSSMetrics DB - Samoladas et al. Information &
Software Technology 2010
• Debian packages - Claes et al. MSR 2015
• Databases in Java projects - Goeminne, Mens
ICSME 2015
Threats to validity
• Representativeness of the respondents wrt the
research community
National Oceanic and Atmospheric Administration, USA

Mais conteúdo relacionado

Mais procurados

Making the web work for science - UND
Making the web work for science - UNDMaking the web work for science - UND
Making the web work for science - UND
Kaitlin Thaney
 
Jeroen Steenbeek Ecopath with Ecosim
Jeroen Steenbeek Ecopath with EcosimJeroen Steenbeek Ecopath with Ecosim
Jeroen Steenbeek Ecopath with Ecosim
David LeBauer
 
SCID Final meeting - Staging Abstraction
SCID Final meeting - Staging AbstractionSCID Final meeting - Staging Abstraction
SCID Final meeting - Staging Abstraction
Bruce Edmonds
 
Kuchinsky_Cytoscape_BOSC2009
Kuchinsky_Cytoscape_BOSC2009Kuchinsky_Cytoscape_BOSC2009
Kuchinsky_Cytoscape_BOSC2009
bosc
 
Citizen science
Citizen scienceCitizen science
Citizen science
samar1407
 

Mais procurados (20)

Social and Technical Evolution of the Ruby on Rails Software Ecosystem
Social and Technical Evolution of the Ruby on Rails Software EcosystemSocial and Technical Evolution of the Ruby on Rails Software Ecosystem
Social and Technical Evolution of the Ruby on Rails Software Ecosystem
 
Making the web work for science - UND
Making the web work for science - UNDMaking the web work for science - UND
Making the web work for science - UND
 
Digital Monitoring of societal Discussions in online Social Networks
Digital Monitoring of societal Discussions in online Social NetworksDigital Monitoring of societal Discussions in online Social Networks
Digital Monitoring of societal Discussions in online Social Networks
 
Christoph Barrett - Policy Informatics at Societal Scale
Christoph Barrett - Policy Informatics at Societal ScaleChristoph Barrett - Policy Informatics at Societal Scale
Christoph Barrett - Policy Informatics at Societal Scale
 
CHI2015 - Citizen Science || Zooniverse
CHI2015 - Citizen Science || ZooniverseCHI2015 - Citizen Science || Zooniverse
CHI2015 - Citizen Science || Zooniverse
 
Jeroen Steenbeek Ecopath with Ecosim
Jeroen Steenbeek Ecopath with EcosimJeroen Steenbeek Ecopath with Ecosim
Jeroen Steenbeek Ecopath with Ecosim
 
Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013
 
SCID Final meeting - Staging Abstraction
SCID Final meeting - Staging AbstractionSCID Final meeting - Staging Abstraction
SCID Final meeting - Staging Abstraction
 
Context Aware Harassment Detection in Social Media [Overview]
Context Aware Harassment Detection in Social Media [Overview]Context Aware Harassment Detection in Social Media [Overview]
Context Aware Harassment Detection in Social Media [Overview]
 
"Building Capacity for Open Research" - AAMC
"Building Capacity for Open Research" - AAMC"Building Capacity for Open Research" - AAMC
"Building Capacity for Open Research" - AAMC
 
Little eScience
Little eScienceLittle eScience
Little eScience
 
Interventionist-methods - Methods in user-technology studies
Interventionist-methods - Methods in user-technology studiesInterventionist-methods - Methods in user-technology studies
Interventionist-methods - Methods in user-technology studies
 
Crops In Silico Workshop, Oxford June 2017
Crops In Silico Workshop, Oxford June 2017Crops In Silico Workshop, Oxford June 2017
Crops In Silico Workshop, Oxford June 2017
 
Eis4
Eis4Eis4
Eis4
 
FORCE2019 Research Comms Conference
FORCE2019 Research Comms ConferenceFORCE2019 Research Comms Conference
FORCE2019 Research Comms Conference
 
Kuchinsky_Cytoscape_BOSC2009
Kuchinsky_Cytoscape_BOSC2009Kuchinsky_Cytoscape_BOSC2009
Kuchinsky_Cytoscape_BOSC2009
 
E-Waste - How to face this issue with the Technology itself
E-Waste - How to face this issue with the Technology itselfE-Waste - How to face this issue with the Technology itself
E-Waste - How to face this issue with the Technology itself
 
Citizen science
Citizen scienceCitizen science
Citizen science
 
Fabelier, a *Lab to make things
Fabelier, a *Lab to make thingsFabelier, a *Lab to make things
Fabelier, a *Lab to make things
 
Thoughts on Big Data and more for the WA State Legislature
Thoughts on Big Data and more for the WA State LegislatureThoughts on Big Data and more for the WA State Legislature
Thoughts on Big Data and more for the WA State Legislature
 

Destaque

Understanding Software Ecosystems
Understanding Software EcosystemsUnderstanding Software Ecosystems
Understanding Software Ecosystems
Bala Iyer
 
MOST RECENT POWER POINT PRESENTATION
MOST RECENT POWER POINT PRESENTATIONMOST RECENT POWER POINT PRESENTATION
MOST RECENT POWER POINT PRESENTATION
Binta Moustapha
 
ISO 9001 LEAD AUDITOR ( QMS ).PDF
ISO 9001 LEAD AUDITOR ( QMS ).PDFISO 9001 LEAD AUDITOR ( QMS ).PDF
ISO 9001 LEAD AUDITOR ( QMS ).PDF
mohammad riyaz
 
Training and development
Training and developmentTraining and development
Training and development
nastrankhalid
 
IBM Watson for Ecosystem Program - You as ISV / Startup can enhance/build app...
IBM Watson for Ecosystem Program - You as ISV / Startup can enhance/build app...IBM Watson for Ecosystem Program - You as ISV / Startup can enhance/build app...
IBM Watson for Ecosystem Program - You as ISV / Startup can enhance/build app...
Romeo Kienzler
 

Destaque (20)

Understanding Software Ecosystems
Understanding Software EcosystemsUnderstanding Software Ecosystems
Understanding Software Ecosystems
 
Software ecosystem
Software ecosystemSoftware ecosystem
Software ecosystem
 
Design for Learning and Assessment in Virtual Worlds
Design for Learning and Assessment in Virtual WorldsDesign for Learning and Assessment in Virtual Worlds
Design for Learning and Assessment in Virtual Worlds
 
Ecosystem visualization methodology
Ecosystem visualization methodologyEcosystem visualization methodology
Ecosystem visualization methodology
 
Wat eco cogins9-24pres16x9
Wat eco cogins9-24pres16x9Wat eco cogins9-24pres16x9
Wat eco cogins9-24pres16x9
 
Open Source Software Ecosystem & Stack
Open Source Software Ecosystem & StackOpen Source Software Ecosystem & Stack
Open Source Software Ecosystem & Stack
 
Understanding and implementation of open source ecosystems final
Understanding and implementation of open source ecosystems finalUnderstanding and implementation of open source ecosystems final
Understanding and implementation of open source ecosystems final
 
QuESo: a Quality Model for Open Source Software Ecosystems
QuESo: a Quality Model for Open Source Software EcosystemsQuESo: a Quality Model for Open Source Software Ecosystems
QuESo: a Quality Model for Open Source Software Ecosystems
 
ISO 13053 Lead Auditor - Four Page Brochure
ISO 13053 Lead Auditor - Four Page Brochure	ISO 13053 Lead Auditor - Four Page Brochure
ISO 13053 Lead Auditor - Four Page Brochure
 
What does six sigma really mean??
What does six sigma really mean??What does six sigma really mean??
What does six sigma really mean??
 
MOST RECENT POWER POINT PRESENTATION
MOST RECENT POWER POINT PRESENTATIONMOST RECENT POWER POINT PRESENTATION
MOST RECENT POWER POINT PRESENTATION
 
ISO 9001 LEAD AUDITOR ( QMS ).PDF
ISO 9001 LEAD AUDITOR ( QMS ).PDFISO 9001 LEAD AUDITOR ( QMS ).PDF
ISO 9001 LEAD AUDITOR ( QMS ).PDF
 
Distinct l diversity anonymization of set valued data
Distinct l diversity anonymization of set valued dataDistinct l diversity anonymization of set valued data
Distinct l diversity anonymization of set valued data
 
Keeping software development ecosystem healthy
Keeping software development ecosystem healthyKeeping software development ecosystem healthy
Keeping software development ecosystem healthy
 
Training and development
Training and developmentTraining and development
Training and development
 
IBM Watson for Ecosystem Program - You as ISV / Startup can enhance/build app...
IBM Watson for Ecosystem Program - You as ISV / Startup can enhance/build app...IBM Watson for Ecosystem Program - You as ISV / Startup can enhance/build app...
IBM Watson for Ecosystem Program - You as ISV / Startup can enhance/build app...
 
PECB Webinar: ISO 21500 - A Guidance to Project Managers on ISO 21500 Project...
PECB Webinar: ISO 21500 - A Guidance to Project Managers on ISO 21500 Project...PECB Webinar: ISO 21500 - A Guidance to Project Managers on ISO 21500 Project...
PECB Webinar: ISO 21500 - A Guidance to Project Managers on ISO 21500 Project...
 
PECB webinar: ISO 50001:2011 - Understanding Energy Management System (EnMS)
PECB webinar: ISO 50001:2011 - Understanding Energy Management System (EnMS) PECB webinar: ISO 50001:2011 - Understanding Energy Management System (EnMS)
PECB webinar: ISO 50001:2011 - Understanding Energy Management System (EnMS)
 
Managing the need for Laboratory Competence in the Food Supply Chain
Managing the need for Laboratory Competence in the Food Supply ChainManaging the need for Laboratory Competence in the Food Supply Chain
Managing the need for Laboratory Competence in the Food Supply Chain
 
BASICS FOR ISO 9001 QMS LEAD AUDITOR COURSE
BASICS  FOR ISO 9001 QMS LEAD AUDITOR COURSEBASICS  FOR ISO 9001 QMS LEAD AUDITOR COURSE
BASICS FOR ISO 9001 QMS LEAD AUDITOR COURSE
 

Semelhante a Challenges in Software Ecosystem Research

Environmental sciencesjun01
Environmental sciencesjun01Environmental sciencesjun01
Environmental sciencesjun01
Clifford Stone
 

Semelhante a Challenges in Software Ecosystem Research (20)

Challenges in Software Ecosystems Research
Challenges in Software Ecosystems ResearchChallenges in Software Ecosystems Research
Challenges in Software Ecosystems Research
 
Dr Manuel Maass (National Autonomous University of Mexico) presenting at the ...
Dr Manuel Maass (National Autonomous University of Mexico) presenting at the ...Dr Manuel Maass (National Autonomous University of Mexico) presenting at the ...
Dr Manuel Maass (National Autonomous University of Mexico) presenting at the ...
 
Sdal overview sallie keller
Sdal overview  sallie kellerSdal overview  sallie keller
Sdal overview sallie keller
 
Elisabeth Shrimpton
Elisabeth ShrimptonElisabeth Shrimpton
Elisabeth Shrimpton
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific Computing
 
Environmental sciencesjun01
Environmental sciencesjun01Environmental sciencesjun01
Environmental sciencesjun01
 
Long Term Ecological Research Network
Long Term Ecological Research NetworkLong Term Ecological Research Network
Long Term Ecological Research Network
 
EarthCube Stakeholder Alignment Survey - End-Users & Professional Societies W...
EarthCube Stakeholder Alignment Survey - End-Users & Professional Societies W...EarthCube Stakeholder Alignment Survey - End-Users & Professional Societies W...
EarthCube Stakeholder Alignment Survey - End-Users & Professional Societies W...
 
informatics_future.pdf
informatics_future.pdfinformatics_future.pdf
informatics_future.pdf
 
Scholarship in the Digital Age
Scholarship in the Digital AgeScholarship in the Digital Age
Scholarship in the Digital Age
 
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive Capability
 
internship environmental toxicology
 internship environmental toxicology  internship environmental toxicology
internship environmental toxicology
 
Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017
 
Accomplishments and Opportunities
Accomplishments and OpportunitiesAccomplishments and Opportunities
Accomplishments and Opportunities
 
Ecologists accelerating discovery: Ecological synthesis paves the way to tran...
Ecologists accelerating discovery: Ecological synthesis paves the way to tran...Ecologists accelerating discovery: Ecological synthesis paves the way to tran...
Ecologists accelerating discovery: Ecological synthesis paves the way to tran...
 
ICT4Health 2015 - R Scott
ICT4Health 2015 - R ScottICT4Health 2015 - R Scott
ICT4Health 2015 - R Scott
 
Seminari CRICC : Avaluació de la recerca.
Seminari CRICC : Avaluació de la recerca. Seminari CRICC : Avaluació de la recerca.
Seminari CRICC : Avaluació de la recerca.
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotech
 
1.1 environmental value systems notes
1.1 environmental value systems notes1.1 environmental value systems notes
1.1 environmental value systems notes
 

Mais de Tom Mens

Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsComparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Tom Mens
 
Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)
Tom Mens
 
Empirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package ManagersEmpirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package Managers
Tom Mens
 

Mais de Tom Mens (20)

How to be(come) a successful PhD student
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
 
Recognising bot activity in collaborative software development
Recognising bot activity in collaborative software developmentRecognising bot activity in collaborative software development
Recognising bot activity in collaborative software development
 
A Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHubA Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHub
 
The (r)evolution of CI/CD on GitHub
 The (r)evolution of CI/CD on GitHub The (r)evolution of CI/CD on GitHub
The (r)evolution of CI/CD on GitHub
 
Nurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the FutureNurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the Future
 
Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?
 
On the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHubOn the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHub
 
On backporting practices in package dependency networks
On backporting practices in package dependency networksOn backporting practices in package dependency networks
On backporting practices in package dependency networks
 
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsComparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
 
Lost in Zero Space
Lost in Zero SpaceLost in Zero Space
Lost in Zero Space
 
Evaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messagesEvaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messages
 
Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!
 
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
 
On the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystemsOn the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystems
 
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
 
Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)
 
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
 
SecoHealth 2019 Research Achievements
SecoHealth 2019 Research AchievementsSecoHealth 2019 Research Achievements
SecoHealth 2019 Research Achievements
 
SECO-Assist 2019 research seminar
SECO-Assist 2019 research seminarSECO-Assist 2019 research seminar
SECO-Assist 2019 research seminar
 
Empirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package ManagersEmpirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package Managers
 

Último

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 

Último (20)

%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
tonesoftg
tonesoftgtonesoftg
tonesoftg
 

Challenges in Software Ecosystem Research

  • 1. Challenges in Software Ecosystems Research Alexander Serebrenik Eindhoven University of Technology The Netherlands @aserebrenik Tom Mens UMons Belgium @tom_mens
  • 4. Definition of an ecosystem Example of an ecosystem Trends and challenges
  • 5. 164 authors of an article or a book chapter on SECO, paper in IWSECO, WEA or Big Systems 2014 141 authors with a valid email address 26* answered the survey * response rate 18,4%, comparable with other surveys
  • 6. Definition of an ecosystem Respondent: “Defining everything as an ecosystem. <…> The word is trend-ish and it causes misunderstandings in the field.”
  • 7. “The complex system of plant, animal, fungal, and microorganism communities and their associated non-living environment interacting as an ecological unit. Ecosystems have no fixed boundaries”
  • 8. [Lungu 2008] [Jansen et al. 2009] [Manikas, Hansen 2013] <biological> communities software projects actors actors environment environment shared markt for software and services, shared platform common technological platform interaction developed and evolve together exchange of information, resources & artefacts symbiotic relationships Definition of an ecosystem
  • 9. social technical economical [Lungu 2008] [Bosch, Bosch- Sijtsema 2009] [*Moore 1993][Jansen et al. 2009] [Mitleton-Kelly 2003] [Manikas, Hansen 2013] Definition of an ecosystem
  • 10. companies app stores OS foundations programming languages operation systems forges & social ecosystems Example of an ecosystem Based on the literature
  • 11. companies app stores OS foundations programming languages operation systems forges & social ecosystems Example of an ecosystem Based on the literature
  • 12. companies app stores OS foundations programming languages operation systems forges & social ecosystems Example of an ecosystem Based on the literature
  • 13. companies app stores OS foundations programming languages operation systems forges & social ecosystems Example of an ecosystem Based on the literature
  • 14. companies app stores OS foundations programming languages operation systems forges & social ecosystems Example of an ecosystem Based on the literature
  • 15. Definition of an ecosystem Example of an ecosystem Respondent: “Defining everything as an ecosystem. <…> The word is trend-ish and it causes misunderstandings in the field.” social economicaltechnical Different perspectives on the same artefacts or different artefacts all together?
  • 16. Trends and challenges 26 survey answers Literature study {29 challenges 8 categories
  • 17. One challenge is to be able to characterize the wealth of the community wrt the wealth of the software components. What is the impact of different collaboration and development practices on the quality of the ecosystem?” Trends and challenges
  • 18. One challenge is to be able to characterize the wealth of the community wrt the wealth of the software components. What is the impact of different collaboration and development practices on the quality of the ecosystem?” Trends and challenges ecosystem quality socio-technical
  • 19. One challenge is to be able to characterize the wealth of the community wrt the wealth of the software components. What is the impact of different collaboration and development practices on the quality of the ecosystem?” Trends and challenges ecosystem quality socio-technical SECOs may consist of many systems. Analysing all these systems as a whole may raise some technical problems, due to the quantity of data to take into account. data analytics amount (volume) large databases with comparable information about the details of a large collection of ecosystems, so that any research could be conducted in a repeatable and comparable way. database of comparable inforeproducible research
  • 20.
  • 21.
  • 24. Privacy: surveys Minority respondents are easy to identify Reproducibility vs privacy
  • 25. Non-sensitive Sensitive Zip Age Nationality Condition 1 13053 28 Russian Heart Disease 2 13068 29 American Heart Disease 3 13068 21 Japanese Viral Infection 4 13053 23 American Viral Infection 5 14853 50 Indian Cancer 6 14853 55 Russian Heart Disease 7 14850 47 American Viral Infection 8 14850 49 American Viral Infection 9 13053 31 American Cancer 10 13053 37 Indian Cancer 11 13068 36 Japanese Cancer 12 13068 35 American Cancer
  • 26. Non-sensitive Sensitive Zip Age Nationality Condition 1 130** <30 * Heart Disease 2 130** <30 * Heart Disease 3 130** <30 * Viral Infection 4 130** <30 * Viral Infection 5 1485* >40 * Cancer 6 1485* >40 * Heart Disease 7 1485* >40 * Viral Infection 8 1485* >40 * Viral Infection 9 130** 30-40 * Cancer 10 130** 30-40 * Cancer 11 130** 30-40 * Cancer 12 130** 30-40 * Cancer
  • 27. Non-sensitive Sensitive Zip Age Nationality Condition 1 130** <30 * Heart Disease 2 130** <30 * Heart Disease 3 130** <30 * Viral Infection 4 130** <30 * Viral Infection 5 1485* >40 * Cancer 6 1485* >40 * Heart Disease 7 1485* >40 * Viral Infection 8 1485* >40 * Viral Infection 9 130** 30-40 * Cancer 10 130** 30-40 * Cancer 11 130** 30-40 * Cancer 12 130** 30-40 * Cancer
  • 28. Are some challenges more important than others?
  • 29. Second survey • Group A: respondents of the previous survey that have provided their email addresses • 26 answers - 20 with mail, invited - 14 responses - 70% • Group B: extended list of ecosystem experts (outside Group A): • 148 invited - 142 valid addresses - 38* responses ~ 27% • Better response rate: 32.1% vs 18.4% (first survey) * One of the respondents that provided an email has not been invited
  • 30.
  • 31. No difference between Group A and Group B Adonis, Unknown, restored by Duquesnoy (1597–1643), Louvre • Analysis of Similarities (ANOSIM) • R: -0.07564 • more dissimilar closer to 1 • Permutational Multivariate Analysis of Variance Using Distance Matrices (ADONIS) • p-value: 0.192
  • 32. Ordering challenges 1. Consider both groups as one set of answers 2. Per question: #very important - #moderately important - #slightly important 3. Lexicographic order on the triples (#very important - #moderately important - #slightly important)
  • 33. Top Three 1. Reproducible and Comparable Research [Providing databases with information about the details of a large collection of ecosystems] 2. Reproducible and Comparable Research [Making research results about ecosystems available in a reproducible way] 3. Offer more advanced ecosystems analysis (e.g., case studies, qualitative and quantitative analysis) [Use more advanced statistical techniques (e.g., survival analysis, econometric aggregation, contrasts)]
  • 34. 4. Understanding and improving the design, architecture, quality and health of software ecosystems [Socio-technical perspective, e.g., comparing the health of the community against the health of the ecosystem components] 5. Ecosystem Governance [Design perspective, e.g., actively supporting the stakeholders' decisions] 6. Understanding and improving an ecosystem's dynamics and evolution [Generalisation perspective, e.g., transferring insights from evolution of individual software systems to evolution of ecosystems] 7. Understanding and improving the design, architecture, quality and health of software ecosystems [Social perspective, e.g., creating an active community around the ecosystem] 8. Interdisciplinary research [Applying ecosystem research techniques to non-classical software ecosystems, e.g., spreadsheets or Matlab Simulink models] 9. Understanding and improving an ecosystem's dynamics and evolution [Design perspective, e.g., providing upgrade strategies when one of the ecosystem elements changes] 10.Ecosystem Governance [Generalisation perspective, e.g., going beyond anecdotal evidence]
  • 35. Reproducible Research: SE problem? Raw$data! Processed$ data$set! Tools$&$ scripts! #MSR$papers$ 200482009! Y" Y" Y" 2" Y" Y" N" 2" Y" P" Y" 1" Y" P" P" 2" Y" P" N" 2" Y" N" Y" 16" Y" N" P" 19" Y" N" N" 64" P" N" Y" 1" P" N" N" 2" N" Y" N" 2" N" P" N" 1" N" N" Y" 7" N" N" P" 2" N" N" N" 31" N/A" N/A" N/A" 17" Robles 2010 Ghezzi, Gall 2013: • Replicated 25 • Partially 27 • Not replicated 36
  • 36. Reproducible and Comparable Research [Providing databases with information about the details of a large collection of ecosystems] Enough? Too big to share? Up-to-date? Still relevant? 1TB
  • 38. Advanced statistics 3. Offer more advanced ecosystems analysis (e.g., case studies, qualitative and quantitative analysis) [Use more advanced statistical techniques (e.g., survival analysis, econometric aggregation, contrasts)]
  • 39. Advanced statistics PAGE 2711/08/15 Two distributions: !  t-test !  Mann-Whitney Multiple distributions: 1.  ANOVA / KW 2.  pairwise t-test / MW Tests can be inconsistent with each other We need a one-phase test!
  • 40. Advanced statistics PAGE 3211/08/15 Idea: ​"  Pair Low High B-A -0.56 -0.44 C-A -0.50 -0.31 D-A -0.32 -0.03 C-B -0.01 0.24 D-B 0.24 0.47 D-C 0.09 0.40 A→B A→C A→D D→B D→C Konietschke, F., Hothorn, LA, and Brunner, E. Rank-based multiple test procedures and simultaneous confidence intervals. Electron. J. Stat. 6 (2012), 738–759. ~
  • 41. T and Software Ecosystems • Stack Overflow and GitHub - Vasilescu et al. Social Com 2013 • Simulink models - Dajsuren et al. QoSA 2013 • GNOME - Vasilescu et al. ESE 2014 • Stack Exchange sites - Wang et al. ICSME 2014 • jEdit, ArgoUML, KOffice - Sun et al. Inf & Software Technology 2015 ~
  • 42. Advanced statistics Mean, median, sum Gini, Theil, Kolm… Choice of an aggregation technique provides different insights but can also affect validity of the results! C. Gini, “Measurement of inequality of incomes,” The Economic Journal, 1921. H. Theil, Economics and Information Theory. North-Holland, 1967 A.B. Atkinson, “On the measurement of inequality,” Journal of Economic Theory, 1970. …
  • 43. Gini, Theil & Software Ecos • Qualitas - Spasojević et al. ICSME 2014 • GNOME - Mens, Goeminne IWSECO 2011, Vasilescu et al. ESE 2014 • Debian - Serebrenik, vd Brand ICSM 2010 • Market shares - Yu, First Monday 2012
  • 44. Advanced statistics % of entities still used after time t? Kaplan, E. L.; Meier, P. (1958). "Nonparametric estimation from incomplete observations". J. Amer. Statist. Assn. 53 (282): 457–481
  • 45. Survival & Software Ecos • FLOSSMetrics DB - Samoladas et al. Information & Software Technology 2010 • Debian packages - Claes et al. MSR 2015 • Databases in Java projects - Goeminne, Mens ICSME 2015
  • 46. Threats to validity • Representativeness of the respondents wrt the research community
  • 47.
  • 48. National Oceanic and Atmospheric Administration, USA