SlideShare uma empresa Scribd logo
1 de 22
Data management for CRP DS research: 
where do we currently stand at ICARDA? 
• CO: CGIAR Open Access and Data Management 
Plans & Implementation (Article 4.1.9) states 
“Open Access and Data Management Plans 
should be prepared in order to ensure 
implementation of this Policy. Such Plans shall, in 
particular, outline a strategy for maximizing 
opportunities to make information products Open 
Access”. 
• Output: Research quality and data quality issues 
in CRP DS research and mechanism/workflow
Data management for CRP DS research: 
where do we currently stand at ICARDA? 
Plan 
•Sources of data under CRP DS 
•Status of DM at ICARDA 
•CRP DS research areas for data 
generation 
•Issues and solutions related to Research 
quality and data quality 
•Workflow for DM sharing
Scope: Sources of data 
Scope of work is determined by observing a 
complex interplay of 
•Base components: crops, livestock, rangelands, 
trees etc. & production systems 
× 
•Biophysical environment constraints: water 
scarcity, land degradation 
× 
•Technological access : Access to the product 
and regulatory environment
Partners in the DM (OA) 
• Who generates the data? Who owns them? Who regulates 
their sharing? 
Outcome: What after archiving with an Open Assess (OA) 
System? 
• Data mining 
• Exploration of large or even BIG data leading to a wider 
picture viewed from the bridge 
• No dearth of random factors/sources in data 
• Availability of prior information 
• Bayesian analysis to span the statistical inference domain 
to reality
CRP DS DM Current Status 
DM Status at ICARDA /Its Flagships Target 
Regions 
•ICARDA Projects: D and DM with scientists, 
archived in their laptops, different various 
locations/countries 
•GU data on Central servers, Amman, Jordan 
•D Manager to be recruited 
•NARS data with NARS
ICARDA Projects dealing with CRP DS 
ICARDA Programs: DSIPS, IWLM, SEPR, BIGM 
(also generating data for other CRPs) 
1. Cropping systems and Agronomy on-station 
and on-farm 
•On-station Trials 
– Single factor, multi-factors including: 
– systems of rotations, intercrop, monocrops 
– crop components 
– fertilizer input 
– IPDM controls and other management factors
CRP DS DM - Research quality 
On-farm Trials: 
• Less frequent: research for technology 
generation 
– Experimental design with small number of 
treatments, small blocks, variable treatment 
designated as control or farmer-technology, 
relatively large number of replications 
•Most frequent: technology verification and 
demonstration 
•Sampling design: large plots, small number of 
sample is a concern
ICARDA Projects dealing with CRP DS 
• A list of data sources for CRP DS and other ICARDA 
projects: 
• Design of crop rotation trials [general] 
• DM and Analyses of data from the 2-course long-term 
wheat rotations (productivity, sustainability aspects 
including time-trend estimation). [NAWA: Long-term 
crop rotation trials on wheat & Barley at Tel Hadya, 
Syria, Long-term wheat rotation trial at Kamishly, Syria, 
Long-term sustainability trials in Egypt, etc.] 
• Evaluation of conservation tillage data [CA trials in 
Jordan and Iraq] 
• Analyses of data from livestock evaluation experiments 
[Long-term trials, wheat & Barley at Tel Hadya]
ICARDA Projects dealing with CRP DS 
ICARDA Outlook: 
•Decentralization of ICARDA has changed the way we do 
our business. 
•Archiving data and sharing has [essentially] become the 
way of our business. 
•We need to extend [quality] data sharing from within 
ICARDA to Public. 
NARS/ five Flagship target regions 
•1) The West African Sahel and dry savannas , 2) East and 
Southern Africa, 3) North Africa and West Asia, 4) Central Asia, 
5) South Asia
ICARDA Projects dealing with CRP DS: 
Key to DQ 
Research quality (RQ) 
•Experimental design could be an issue (in terms of blocking and replications) 
•Approach/Solution: thorough discussion with subject matter specialists and 
biometrician/statistician 
•Resources for enhancing RQ and DQ: 
• 
•JNR Jeffers (1978). Statistical Checklist: Design of Experiments No. 1 (Statistical checklists). Institute of 
Terrestrial Ecology, Natural Environment Research Council, Cambridge, UK. 
http://www.sawleystudios.co.uk/jnrj/StatisticalCheck/Design.htm) 
• 
•JNR Jeffers (1979). Sampling (Statistical Checklist 2). Institute of Terrestrial Ecology, Natural 
Environment Research Council, Cambridge, UK. 
http://www.sawleystudios.co.uk/jnrj/StatisticalCheck/Sampling.htm 
• 
•David J. Finney (1990). Statistical data-their care and maintenance. Indian Society of Agricultural 
Statistics. 
•“This bulletin is extremely useful for students and research workers … topics dealt with are: acquisition of 
data, design of data gathering , care for data, types and units of data analysis and databases, copying, 
statistical ethics, data-entry to the computer, data scrutiny, integrity and some illustrations.”
ICARDA Projects dealing with CRP DS 
Examples of Data quality issues: 
•Experimental design accepted; crop management properly followed. 
• Experimental plots: plot size, harvested areas [2-row, 3-row, 4-row plots], 
calculation per hectare basis 
•Days to 50% flowering- how many plants were actually observed? 
•plant height (cm)- number of plants 
•seed yield, bio yield – area used; drying methods 
•Data entry? 
•Lack of Data recording electronic devices and transfer to file at laptop 
– Early days: field-books 
– Recent: Android Apps etc. 
– Data in Excel worksheet 
•What checks should we perform? 
•What should be the level of Experimental data quality for public sharing
ICARDA Projects dealing with CRP DS 
2. Crop Improvement CRPs (CRP Wheat, CRP DC, CRP GL) 
•Single factor- Crop varieties 
•Unreplicated designs for test materials + replicated or repeated 
checks 
•Replicated variety trials in RCB, IBD (alpha-designs), p-rep designs 
•METs (Multi-environments/Multi-location and multi-year trials) 
•Two-factor experiments 
– Crops + Crop varieties 
– Sometimes agronomic trials – planting dates, IPDMs etc. 
•Result outputs from Commodity CRPs, where breeding is the key 
component (CRP Wheat, CRP DC, CRP GL) flow to CRP DS. 
Where are the data? 
•Data with scientists in their laptops 
•Status in relation to DM(OA)/sharing is unknown to me
ICARDA Projects dealing with CRP DS 
3. Issues of the Poor Quality of Data- Indicators and 
resolves 
A frequent issue of data quality 
•Is really something wrong with my data? Some statistical 
procedures work and some others do not, BUT the data are 
the same. Regression, GLM works but ANOVA does not. 
What is wrong with Stats? 
•ANOVA may turn to be a great tool for data checking, -- 
missing values in data variables may be the reality. 
•How about missing or repeats by mistake in a factor levels or 
factorial combinations?
ICARDA Projects dealing with CRP DS 
Some cases of data quality issues: 
•1. Research Quality– experimental design OK but Data on Design not 
OK/ design factors incorrectly entered; frequently encountered; Must 
be corrected before analysis else we have carried out a study different 
from what we planned and still think. 
– factor combinations not aligning with design (not missing 
observations) 
•2. Observed data values; traits values: errors of recording/data 
transfers to files 
– values out of range (a variable to lie within 0-100 or 0-1 goes outside; 
recording error) 
– Outliers/ recorded values appear too extreme. Will require validation 
with the assistant/scientists and if errors are found then must be 
corrected; generally viewed as the context of uni-variate analysis. 
– Outliers may have issues of interpretation and detection. Looks outlier 
in BY but not in log(BY) or sqrt(BY). There might be multivariate 
outliers. A column of remarks, possibly in the field book may support 
the recorded data.
ICARDA Projects dealing with CRP DS 
Some cases of data quality issues… continued 
•3. Relationships between the traits appearing along the crop 
development cycle may also be identified and used to build in data 
quality 
• DAF << DMAT 
• GY << BY 
•4. Helpful: Electronic data loggers (balance, Android Apps, with 
GIS/Date) 
•5. Role of the scientist/ a data supervisor must be made effective— 
random checks on data recording in the field book as well as in the 
file. Observations should be validated by another researchers 
experienced in the same discipline, particularly with visual scores. 
Random checks could be more effective. Data errors could be linked 
to the observer.
ICARDA Projects dealing with CRP DS 
Some checks and balances: 
•Data care bulletin (see References) 
Tools: 
•Design experiment/survey specific tools 
(Biometrician/Statistician to Data Manager). Clearly define 
the roles. 
•Examine factors combinations appearing in the data 
•Examine tables/cross-tables for qualitative data 
•Descriptive statistics 
• min, max, range, ratio=max/min (min>0) 
• Histograms 
•Box-plots and other diagnostics
ICARDA Projects dealing with CRP DS 
Some checks and balances … continued. 
•No go with ANOVA may turn to be a good thing to check 
bad data. However, as in above, 
– Missing values in response/covariate variables are a reality 
– But missing a factor level or factor combinations appear due to 
data entry error; combinations being different from those in the 
design. 
– Cases of repeated units – data entry errors 
•Outliers, if detected via a model fitting should stay in the 
data. Of course data validation, where possible, is 
encouraged.
ICARDA Projects dealing with CRP DS 
Some checks and balances...continued: 
•Benefiting from ICRISAT Tools and Techniques 
• on data checking tools 
• archiving the data on public platforms (an 
enforcer of Data Quality) 
• e.g. data systems from ICRISAT, Dataverse ( 
http://dvn.iq.harvard.edu/dvn/) 
•Computing tools/procedures: Training and development 
•Excel macros, Genstat/SAS/SPSS/R/other software 
•Database development/datasheet preparation/ archiving
ICARDA Projects dealing with CRP DS 
An attractive specialization: 
•Data Science 
•The Data Scientist’s Toolbox: 
https://www.coursera.org/course/datascitoolbox
ICARDA Projects dealing with CRP DS 
Crystalizing an approach: 
4. CRP: Dryland Systems Management: Workflow 
components 
Home Center: Project/Meta data: Project ID, objectives, location, 
year, personnel (Planner, M&E team, data collector etc.), trial level 
information, factors (design and treatment), variables etc., A report of 
data validation in Step 2; links to data; 
•Data <<<< validation (via agreed tools) 
•Mechanism for Data Quality Check 
•1. Scientists >> >2. Statistician/DM team: apply the agreed tools 
• a) If fails-----> (1) to scientist for update 
• b) If passes----> Get metadata and links to data 
•2. Archiving (what? who will do this? DM Team?) 
• Sharing permissions etc. This could be a Workflow of permissions: 
Requester ---> Approval 1--->Approval 2 ---…---> Director CRP DS/nominee.
ICARDA Projects dealing with CRP DS 
…..continued: 
•Information Management. This refers to the [statistically 
analysed] results files/publications generated. 
•Knowledge Management: Key findings, Implications, 
lessons learned 
NARS 
•Identify the active NARS partners 
•Training on the above tools and workflow, Share Policy 
and Procedure on CRP DS DM (OA) 
•Identify the risk factors and their indicators and develop 
an action plan with resources required 
•Measure and Monitor the impact
Thank you

Mais conteúdo relacionado

Semelhante a Where do we currently stand at ICARDA?

The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...israel edem
 
Data Collection Preparation
Data Collection PreparationData Collection Preparation
Data Collection PreparationBusiness Student
 
Cork big data_analytics-de_puy_synthes_datsci_cit
Cork big data_analytics-de_puy_synthes_datsci_citCork big data_analytics-de_puy_synthes_datsci_cit
Cork big data_analytics-de_puy_synthes_datsci_citDana Brophy
 
Coping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCoping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCarly Strasser
 
Data Management Lab: Session 2 slides
Data Management Lab: Session 2 slidesData Management Lab: Session 2 slides
Data Management Lab: Session 2 slidesIUPUI
 
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docxDATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docxrandyburney60861
 
Quartesian capabilities-2013
Quartesian capabilities-2013Quartesian capabilities-2013
Quartesian capabilities-2013Benjamin Jackson
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aRai University
 
Ensuring data quality
Ensuring data qualityEnsuring data quality
Ensuring data qualityIUPUI
 
Multi variate presentation
Multi variate presentationMulti variate presentation
Multi variate presentationArun Kumar
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptxXanGwaps
 
Leveraging Oracle's Life Sciences Data Hub to Enable Dynamic Cross-Study Anal...
Leveraging Oracle's Life Sciences Data Hub to Enable Dynamic Cross-Study Anal...Leveraging Oracle's Life Sciences Data Hub to Enable Dynamic Cross-Study Anal...
Leveraging Oracle's Life Sciences Data Hub to Enable Dynamic Cross-Study Anal...Perficient
 
Saksham Sarode - Building Effective test Data Management in Distributed Envir...
Saksham Sarode - Building Effective test Data Management in Distributed Envir...Saksham Sarode - Building Effective test Data Management in Distributed Envir...
Saksham Sarode - Building Effective test Data Management in Distributed Envir...TEST Huddle
 
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars Joel Saltz
 
Making powerful science: an introduction to NGS data analysis
Making powerful science: an introduction to NGS data analysisMaking powerful science: an introduction to NGS data analysis
Making powerful science: an introduction to NGS data analysisAdamCribbs1
 
Lec 1 integrating data science and data analytics in various research thrust
Lec 1 integrating data science and data analytics in various research thrustLec 1 integrating data science and data analytics in various research thrust
Lec 1 integrating data science and data analytics in various research thrustMenchita Falcutila Dumlao
 

Semelhante a Where do we currently stand at ICARDA? (20)

The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
 
Data Collection Preparation
Data Collection PreparationData Collection Preparation
Data Collection Preparation
 
Cork big data_analytics-de_puy_synthes_datsci_cit
Cork big data_analytics-de_puy_synthes_datsci_citCork big data_analytics-de_puy_synthes_datsci_cit
Cork big data_analytics-de_puy_synthes_datsci_cit
 
Coping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCoping with Data for WHOI JP Students
Coping with Data for WHOI JP Students
 
Data Management Lab: Session 2 slides
Data Management Lab: Session 2 slidesData Management Lab: Session 2 slides
Data Management Lab: Session 2 slides
 
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docxDATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
 
CDISC-CDASH
CDISC-CDASHCDISC-CDASH
CDISC-CDASH
 
Quartesian capabilities-2013
Quartesian capabilities-2013Quartesian capabilities-2013
Quartesian capabilities-2013
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation a
 
Amy Driskell - Information management and data Quality
Amy Driskell - Information management and data QualityAmy Driskell - Information management and data Quality
Amy Driskell - Information management and data Quality
 
Ensuring data quality
Ensuring data qualityEnsuring data quality
Ensuring data quality
 
Multi variate presentation
Multi variate presentationMulti variate presentation
Multi variate presentation
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
 
Leveraging Oracle's Life Sciences Data Hub to Enable Dynamic Cross-Study Anal...
Leveraging Oracle's Life Sciences Data Hub to Enable Dynamic Cross-Study Anal...Leveraging Oracle's Life Sciences Data Hub to Enable Dynamic Cross-Study Anal...
Leveraging Oracle's Life Sciences Data Hub to Enable Dynamic Cross-Study Anal...
 
Saksham Sarode - Building Effective test Data Management in Distributed Envir...
Saksham Sarode - Building Effective test Data Management in Distributed Envir...Saksham Sarode - Building Effective test Data Management in Distributed Envir...
Saksham Sarode - Building Effective test Data Management in Distributed Envir...
 
Challenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL modelChallenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL model
 
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
 
Making powerful science: an introduction to NGS data analysis
Making powerful science: an introduction to NGS data analysisMaking powerful science: an introduction to NGS data analysis
Making powerful science: an introduction to NGS data analysis
 
Harmel - Monitoring to Support and Improve H/WQ Modeling
Harmel - Monitoring to Support and Improve H/WQ ModelingHarmel - Monitoring to Support and Improve H/WQ Modeling
Harmel - Monitoring to Support and Improve H/WQ Modeling
 
Lec 1 integrating data science and data analytics in various research thrust
Lec 1 integrating data science and data analytics in various research thrustLec 1 integrating data science and data analytics in various research thrust
Lec 1 integrating data science and data analytics in various research thrust
 

Mais de CGIAR Research Program on Dryland Systems

Response of Mature Olive Trees - Switching from Conventional to Drip irrigation
Response of Mature Olive Trees - Switching from Conventional to Drip irrigationResponse of Mature Olive Trees - Switching from Conventional to Drip irrigation
Response of Mature Olive Trees - Switching from Conventional to Drip irrigationCGIAR Research Program on Dryland Systems
 
Nadira and Dina final gender ds presentation ncare meeting and planning of 2...
Nadira and Dina final gender ds  presentation ncare meeting and planning of 2...Nadira and Dina final gender ds  presentation ncare meeting and planning of 2...
Nadira and Dina final gender ds presentation ncare meeting and planning of 2...CGIAR Research Program on Dryland Systems
 

Mais de CGIAR Research Program on Dryland Systems (20)

Impact of Different Levels of Supplemental Irrigation on Olive Productivity
Impact of Different Levels of Supplemental Irrigation on Olive ProductivityImpact of Different Levels of Supplemental Irrigation on Olive Productivity
Impact of Different Levels of Supplemental Irrigation on Olive Productivity
 
Response of Mature Olive Trees - Switching from Conventional to Drip irrigation
Response of Mature Olive Trees - Switching from Conventional to Drip irrigationResponse of Mature Olive Trees - Switching from Conventional to Drip irrigation
Response of Mature Olive Trees - Switching from Conventional to Drip irrigation
 
Achievements and Key Findings (Syria)
Achievements and Key Findings (Syria)Achievements and Key Findings (Syria)
Achievements and Key Findings (Syria)
 
Policies: Land Consolidation and Fragmentation
Policies: Land Consolidation and FragmentationPolicies: Land Consolidation and Fragmentation
Policies: Land Consolidation and Fragmentation
 
Value Chain Analysis
Value Chain AnalysisValue Chain Analysis
Value Chain Analysis
 
System Vulnerability Analysis
System Vulnerability AnalysisSystem Vulnerability Analysis
System Vulnerability Analysis
 
Geoinformatics Application: Integrated Agro-ecosystem Research in Jordan
Geoinformatics Application: Integrated Agro-ecosystem Research in JordanGeoinformatics Application: Integrated Agro-ecosystem Research in Jordan
Geoinformatics Application: Integrated Agro-ecosystem Research in Jordan
 
Data management, Analysis, and Sharing
Data management, Analysis, and SharingData management, Analysis, and Sharing
Data management, Analysis, and Sharing
 
Data Management: Geo-informatics
Data Management: Geo-informaticsData Management: Geo-informatics
Data Management: Geo-informatics
 
The Value of Data
The Value of DataThe Value of Data
The Value of Data
 
Overview: Information Management at ICARDA
Overview: Information Management at ICARDAOverview: Information Management at ICARDA
Overview: Information Management at ICARDA
 
Open Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality DataOpen Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality Data
 
Contextual relevance and implementation in Jordan
Contextual relevance and implementation in JordanContextual relevance and implementation in Jordan
Contextual relevance and implementation in Jordan
 
Yazbek crp ds - icarda - ncare technical meeting - 21-22 sept 2014
Yazbek  crp ds - icarda - ncare technical meeting - 21-22 sept 2014Yazbek  crp ds - icarda - ncare technical meeting - 21-22 sept 2014
Yazbek crp ds - icarda - ncare technical meeting - 21-22 sept 2014
 
Nadira and Dina final gender ds presentation ncare meeting and planning of 2...
Nadira and Dina final gender ds  presentation ncare meeting and planning of 2...Nadira and Dina final gender ds  presentation ncare meeting and planning of 2...
Nadira and Dina final gender ds presentation ncare meeting and planning of 2...
 
Improving livestock production in Jordan
Improving livestock production  in Jordan Improving livestock production  in Jordan
Improving livestock production in Jordan
 
Mudabber crp ds soil conservation
Mudabber  crp ds    soil conservationMudabber  crp ds    soil conservation
Mudabber crp ds soil conservation
 
Achievements managing agro-pastoral rangelands-final by Mounir Louhaichi
Achievements managing agro-pastoral rangelands-final by Mounir LouhaichiAchievements managing agro-pastoral rangelands-final by Mounir Louhaichi
Achievements managing agro-pastoral rangelands-final by Mounir Louhaichi
 
Jordan morocco sept by Hichem Bn SAlem2014
Jordan morocco sept by Hichem Bn SAlem2014Jordan morocco sept by Hichem Bn SAlem2014
Jordan morocco sept by Hichem Bn SAlem2014
 
Introducing and disseminating forage and grain based CA systems in Jordan
Introducing and disseminating forage and grain based CA systems in Jordan Introducing and disseminating forage and grain based CA systems in Jordan
Introducing and disseminating forage and grain based CA systems in Jordan
 

Último

Düsseldorf U学位证,杜塞尔多夫大学毕业证书1:1制作
Düsseldorf U学位证,杜塞尔多夫大学毕业证书1:1制作Düsseldorf U学位证,杜塞尔多夫大学毕业证书1:1制作
Düsseldorf U学位证,杜塞尔多夫大学毕业证书1:1制作f3774p8b
 
Science, Technology and Nation Building.pptx
Science, Technology and Nation Building.pptxScience, Technology and Nation Building.pptx
Science, Technology and Nation Building.pptxgrandmarshall132
 
Asexual-and-Sexual-Reproduction.huhupptx
Asexual-and-Sexual-Reproduction.huhupptxAsexual-and-Sexual-Reproduction.huhupptx
Asexual-and-Sexual-Reproduction.huhupptxMyBrightestStarParkJ
 
EMP (Environment Management Plan . .pptx
EMP (Environment Management Plan . .pptxEMP (Environment Management Plan . .pptx
EMP (Environment Management Plan . .pptxSarmad Naeem
 
Limnology and Wetland Management 2023 NaRM.pptx
Limnology and Wetland Management 2023 NaRM.pptxLimnology and Wetland Management 2023 NaRM.pptx
Limnology and Wetland Management 2023 NaRM.pptxTesfahunTesema
 
毕业文凭制作#回国入职#diploma#degree美国密苏里大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree美国密苏里大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree 毕业文凭制作#回国入职#diploma#degree美国密苏里大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree美国密苏里大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree ttt fff
 
办理德州理工大学毕业证书TTU文凭学位证书
办理德州理工大学毕业证书TTU文凭学位证书办理德州理工大学毕业证书TTU文凭学位证书
办理德州理工大学毕业证书TTU文凭学位证书zdzoqco
 
885MTAMount DMU University Bachelor's Diploma in Education
885MTAMount DMU University Bachelor's Diploma in Education885MTAMount DMU University Bachelor's Diploma in Education
885MTAMount DMU University Bachelor's Diploma in Educationz xss
 
BIODIVERSITY QUIZ ELIMINATION ROUND.pptx
BIODIVERSITY QUIZ ELIMINATION ROUND.pptxBIODIVERSITY QUIZ ELIMINATION ROUND.pptx
BIODIVERSITY QUIZ ELIMINATION ROUND.pptxROLANARIBATO3
 
NO1 Certified Rohani Amil In Islamabad Amil Baba in Rawalpindi Kala Jadu Amil...
NO1 Certified Rohani Amil In Islamabad Amil Baba in Rawalpindi Kala Jadu Amil...NO1 Certified Rohani Amil In Islamabad Amil Baba in Rawalpindi Kala Jadu Amil...
NO1 Certified Rohani Amil In Islamabad Amil Baba in Rawalpindi Kala Jadu Amil...Amil baba
 
Making a Difference: Understanding the Upcycling and Recycling Difference
Making a Difference: Understanding the Upcycling and Recycling DifferenceMaking a Difference: Understanding the Upcycling and Recycling Difference
Making a Difference: Understanding the Upcycling and Recycling DifferenceSwag Cycle
 
global trend Chapter 1.presentation power point
global trend Chapter 1.presentation power pointglobal trend Chapter 1.presentation power point
global trend Chapter 1.presentation power pointyohannisyohannis54
 
Along the Lakefront, "Menacing Unknown"s
Along the Lakefront, "Menacing Unknown"sAlong the Lakefront, "Menacing Unknown"s
Along the Lakefront, "Menacing Unknown"syalehistoricalreview
 
办理(Victoria毕业证书)维多利亚大学毕业证成绩单原版一比一
办理(Victoria毕业证书)维多利亚大学毕业证成绩单原版一比一办理(Victoria毕业证书)维多利亚大学毕业证成绩单原版一比一
办理(Victoria毕业证书)维多利亚大学毕业证成绩单原版一比一z xss
 
Species composition, diversity and community structure of mangroves in Barang...
Species composition, diversity and community structure of mangroves in Barang...Species composition, diversity and community structure of mangroves in Barang...
Species composition, diversity and community structure of mangroves in Barang...Open Access Research Paper
 
INSIDER THREAT PREVENTION IN THE US BANKING SYSTEM
INSIDER THREAT PREVENTION IN THE US BANKING SYSTEMINSIDER THREAT PREVENTION IN THE US BANKING SYSTEM
INSIDER THREAT PREVENTION IN THE US BANKING SYSTEMijsc
 
5 Wondrous Places You Should Visit at Least Once in Your Lifetime (1).pdf
5 Wondrous Places You Should Visit at Least Once in Your Lifetime (1).pdf5 Wondrous Places You Should Visit at Least Once in Your Lifetime (1).pdf
5 Wondrous Places You Should Visit at Least Once in Your Lifetime (1).pdfsrivastavaakshat51
 

Último (20)

Düsseldorf U学位证,杜塞尔多夫大学毕业证书1:1制作
Düsseldorf U学位证,杜塞尔多夫大学毕业证书1:1制作Düsseldorf U学位证,杜塞尔多夫大学毕业证书1:1制作
Düsseldorf U学位证,杜塞尔多夫大学毕业证书1:1制作
 
Science, Technology and Nation Building.pptx
Science, Technology and Nation Building.pptxScience, Technology and Nation Building.pptx
Science, Technology and Nation Building.pptx
 
Asexual-and-Sexual-Reproduction.huhupptx
Asexual-and-Sexual-Reproduction.huhupptxAsexual-and-Sexual-Reproduction.huhupptx
Asexual-and-Sexual-Reproduction.huhupptx
 
PLANTILLAS DE MEMORAMA CIENCIAS NATURALES
PLANTILLAS DE MEMORAMA CIENCIAS NATURALESPLANTILLAS DE MEMORAMA CIENCIAS NATURALES
PLANTILLAS DE MEMORAMA CIENCIAS NATURALES
 
EMP (Environment Management Plan . .pptx
EMP (Environment Management Plan . .pptxEMP (Environment Management Plan . .pptx
EMP (Environment Management Plan . .pptx
 
Limnology and Wetland Management 2023 NaRM.pptx
Limnology and Wetland Management 2023 NaRM.pptxLimnology and Wetland Management 2023 NaRM.pptx
Limnology and Wetland Management 2023 NaRM.pptx
 
毕业文凭制作#回国入职#diploma#degree美国密苏里大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree美国密苏里大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree 毕业文凭制作#回国入职#diploma#degree美国密苏里大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree美国密苏里大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
办理德州理工大学毕业证书TTU文凭学位证书
办理德州理工大学毕业证书TTU文凭学位证书办理德州理工大学毕业证书TTU文凭学位证书
办理德州理工大学毕业证书TTU文凭学位证书
 
885MTAMount DMU University Bachelor's Diploma in Education
885MTAMount DMU University Bachelor's Diploma in Education885MTAMount DMU University Bachelor's Diploma in Education
885MTAMount DMU University Bachelor's Diploma in Education
 
young call girls in Janakpuri🔝 9953056974 🔝 escort Service
young call girls in Janakpuri🔝 9953056974 🔝 escort Serviceyoung call girls in Janakpuri🔝 9953056974 🔝 escort Service
young call girls in Janakpuri🔝 9953056974 🔝 escort Service
 
BIODIVERSITY QUIZ ELIMINATION ROUND.pptx
BIODIVERSITY QUIZ ELIMINATION ROUND.pptxBIODIVERSITY QUIZ ELIMINATION ROUND.pptx
BIODIVERSITY QUIZ ELIMINATION ROUND.pptx
 
NO1 Certified Rohani Amil In Islamabad Amil Baba in Rawalpindi Kala Jadu Amil...
NO1 Certified Rohani Amil In Islamabad Amil Baba in Rawalpindi Kala Jadu Amil...NO1 Certified Rohani Amil In Islamabad Amil Baba in Rawalpindi Kala Jadu Amil...
NO1 Certified Rohani Amil In Islamabad Amil Baba in Rawalpindi Kala Jadu Amil...
 
Making a Difference: Understanding the Upcycling and Recycling Difference
Making a Difference: Understanding the Upcycling and Recycling DifferenceMaking a Difference: Understanding the Upcycling and Recycling Difference
Making a Difference: Understanding the Upcycling and Recycling Difference
 
global trend Chapter 1.presentation power point
global trend Chapter 1.presentation power pointglobal trend Chapter 1.presentation power point
global trend Chapter 1.presentation power point
 
Along the Lakefront, "Menacing Unknown"s
Along the Lakefront, "Menacing Unknown"sAlong the Lakefront, "Menacing Unknown"s
Along the Lakefront, "Menacing Unknown"s
 
办理(Victoria毕业证书)维多利亚大学毕业证成绩单原版一比一
办理(Victoria毕业证书)维多利亚大学毕业证成绩单原版一比一办理(Victoria毕业证书)维多利亚大学毕业证成绩单原版一比一
办理(Victoria毕业证书)维多利亚大学毕业证成绩单原版一比一
 
Species composition, diversity and community structure of mangroves in Barang...
Species composition, diversity and community structure of mangroves in Barang...Species composition, diversity and community structure of mangroves in Barang...
Species composition, diversity and community structure of mangroves in Barang...
 
INSIDER THREAT PREVENTION IN THE US BANKING SYSTEM
INSIDER THREAT PREVENTION IN THE US BANKING SYSTEMINSIDER THREAT PREVENTION IN THE US BANKING SYSTEM
INSIDER THREAT PREVENTION IN THE US BANKING SYSTEM
 
5 Wondrous Places You Should Visit at Least Once in Your Lifetime (1).pdf
5 Wondrous Places You Should Visit at Least Once in Your Lifetime (1).pdf5 Wondrous Places You Should Visit at Least Once in Your Lifetime (1).pdf
5 Wondrous Places You Should Visit at Least Once in Your Lifetime (1).pdf
 
Biopesticide. pptx.
Biopesticide. pptx.Biopesticide. pptx.
Biopesticide. pptx.
 

Where do we currently stand at ICARDA?

  • 1. Data management for CRP DS research: where do we currently stand at ICARDA? • CO: CGIAR Open Access and Data Management Plans & Implementation (Article 4.1.9) states “Open Access and Data Management Plans should be prepared in order to ensure implementation of this Policy. Such Plans shall, in particular, outline a strategy for maximizing opportunities to make information products Open Access”. • Output: Research quality and data quality issues in CRP DS research and mechanism/workflow
  • 2. Data management for CRP DS research: where do we currently stand at ICARDA? Plan •Sources of data under CRP DS •Status of DM at ICARDA •CRP DS research areas for data generation •Issues and solutions related to Research quality and data quality •Workflow for DM sharing
  • 3. Scope: Sources of data Scope of work is determined by observing a complex interplay of •Base components: crops, livestock, rangelands, trees etc. & production systems × •Biophysical environment constraints: water scarcity, land degradation × •Technological access : Access to the product and regulatory environment
  • 4. Partners in the DM (OA) • Who generates the data? Who owns them? Who regulates their sharing? Outcome: What after archiving with an Open Assess (OA) System? • Data mining • Exploration of large or even BIG data leading to a wider picture viewed from the bridge • No dearth of random factors/sources in data • Availability of prior information • Bayesian analysis to span the statistical inference domain to reality
  • 5. CRP DS DM Current Status DM Status at ICARDA /Its Flagships Target Regions •ICARDA Projects: D and DM with scientists, archived in their laptops, different various locations/countries •GU data on Central servers, Amman, Jordan •D Manager to be recruited •NARS data with NARS
  • 6. ICARDA Projects dealing with CRP DS ICARDA Programs: DSIPS, IWLM, SEPR, BIGM (also generating data for other CRPs) 1. Cropping systems and Agronomy on-station and on-farm •On-station Trials – Single factor, multi-factors including: – systems of rotations, intercrop, monocrops – crop components – fertilizer input – IPDM controls and other management factors
  • 7. CRP DS DM - Research quality On-farm Trials: • Less frequent: research for technology generation – Experimental design with small number of treatments, small blocks, variable treatment designated as control or farmer-technology, relatively large number of replications •Most frequent: technology verification and demonstration •Sampling design: large plots, small number of sample is a concern
  • 8. ICARDA Projects dealing with CRP DS • A list of data sources for CRP DS and other ICARDA projects: • Design of crop rotation trials [general] • DM and Analyses of data from the 2-course long-term wheat rotations (productivity, sustainability aspects including time-trend estimation). [NAWA: Long-term crop rotation trials on wheat & Barley at Tel Hadya, Syria, Long-term wheat rotation trial at Kamishly, Syria, Long-term sustainability trials in Egypt, etc.] • Evaluation of conservation tillage data [CA trials in Jordan and Iraq] • Analyses of data from livestock evaluation experiments [Long-term trials, wheat & Barley at Tel Hadya]
  • 9. ICARDA Projects dealing with CRP DS ICARDA Outlook: •Decentralization of ICARDA has changed the way we do our business. •Archiving data and sharing has [essentially] become the way of our business. •We need to extend [quality] data sharing from within ICARDA to Public. NARS/ five Flagship target regions •1) The West African Sahel and dry savannas , 2) East and Southern Africa, 3) North Africa and West Asia, 4) Central Asia, 5) South Asia
  • 10. ICARDA Projects dealing with CRP DS: Key to DQ Research quality (RQ) •Experimental design could be an issue (in terms of blocking and replications) •Approach/Solution: thorough discussion with subject matter specialists and biometrician/statistician •Resources for enhancing RQ and DQ: • •JNR Jeffers (1978). Statistical Checklist: Design of Experiments No. 1 (Statistical checklists). Institute of Terrestrial Ecology, Natural Environment Research Council, Cambridge, UK. http://www.sawleystudios.co.uk/jnrj/StatisticalCheck/Design.htm) • •JNR Jeffers (1979). Sampling (Statistical Checklist 2). Institute of Terrestrial Ecology, Natural Environment Research Council, Cambridge, UK. http://www.sawleystudios.co.uk/jnrj/StatisticalCheck/Sampling.htm • •David J. Finney (1990). Statistical data-their care and maintenance. Indian Society of Agricultural Statistics. •“This bulletin is extremely useful for students and research workers … topics dealt with are: acquisition of data, design of data gathering , care for data, types and units of data analysis and databases, copying, statistical ethics, data-entry to the computer, data scrutiny, integrity and some illustrations.”
  • 11. ICARDA Projects dealing with CRP DS Examples of Data quality issues: •Experimental design accepted; crop management properly followed. • Experimental plots: plot size, harvested areas [2-row, 3-row, 4-row plots], calculation per hectare basis •Days to 50% flowering- how many plants were actually observed? •plant height (cm)- number of plants •seed yield, bio yield – area used; drying methods •Data entry? •Lack of Data recording electronic devices and transfer to file at laptop – Early days: field-books – Recent: Android Apps etc. – Data in Excel worksheet •What checks should we perform? •What should be the level of Experimental data quality for public sharing
  • 12. ICARDA Projects dealing with CRP DS 2. Crop Improvement CRPs (CRP Wheat, CRP DC, CRP GL) •Single factor- Crop varieties •Unreplicated designs for test materials + replicated or repeated checks •Replicated variety trials in RCB, IBD (alpha-designs), p-rep designs •METs (Multi-environments/Multi-location and multi-year trials) •Two-factor experiments – Crops + Crop varieties – Sometimes agronomic trials – planting dates, IPDMs etc. •Result outputs from Commodity CRPs, where breeding is the key component (CRP Wheat, CRP DC, CRP GL) flow to CRP DS. Where are the data? •Data with scientists in their laptops •Status in relation to DM(OA)/sharing is unknown to me
  • 13. ICARDA Projects dealing with CRP DS 3. Issues of the Poor Quality of Data- Indicators and resolves A frequent issue of data quality •Is really something wrong with my data? Some statistical procedures work and some others do not, BUT the data are the same. Regression, GLM works but ANOVA does not. What is wrong with Stats? •ANOVA may turn to be a great tool for data checking, -- missing values in data variables may be the reality. •How about missing or repeats by mistake in a factor levels or factorial combinations?
  • 14. ICARDA Projects dealing with CRP DS Some cases of data quality issues: •1. Research Quality– experimental design OK but Data on Design not OK/ design factors incorrectly entered; frequently encountered; Must be corrected before analysis else we have carried out a study different from what we planned and still think. – factor combinations not aligning with design (not missing observations) •2. Observed data values; traits values: errors of recording/data transfers to files – values out of range (a variable to lie within 0-100 or 0-1 goes outside; recording error) – Outliers/ recorded values appear too extreme. Will require validation with the assistant/scientists and if errors are found then must be corrected; generally viewed as the context of uni-variate analysis. – Outliers may have issues of interpretation and detection. Looks outlier in BY but not in log(BY) or sqrt(BY). There might be multivariate outliers. A column of remarks, possibly in the field book may support the recorded data.
  • 15. ICARDA Projects dealing with CRP DS Some cases of data quality issues… continued •3. Relationships between the traits appearing along the crop development cycle may also be identified and used to build in data quality • DAF << DMAT • GY << BY •4. Helpful: Electronic data loggers (balance, Android Apps, with GIS/Date) •5. Role of the scientist/ a data supervisor must be made effective— random checks on data recording in the field book as well as in the file. Observations should be validated by another researchers experienced in the same discipline, particularly with visual scores. Random checks could be more effective. Data errors could be linked to the observer.
  • 16. ICARDA Projects dealing with CRP DS Some checks and balances: •Data care bulletin (see References) Tools: •Design experiment/survey specific tools (Biometrician/Statistician to Data Manager). Clearly define the roles. •Examine factors combinations appearing in the data •Examine tables/cross-tables for qualitative data •Descriptive statistics • min, max, range, ratio=max/min (min>0) • Histograms •Box-plots and other diagnostics
  • 17. ICARDA Projects dealing with CRP DS Some checks and balances … continued. •No go with ANOVA may turn to be a good thing to check bad data. However, as in above, – Missing values in response/covariate variables are a reality – But missing a factor level or factor combinations appear due to data entry error; combinations being different from those in the design. – Cases of repeated units – data entry errors •Outliers, if detected via a model fitting should stay in the data. Of course data validation, where possible, is encouraged.
  • 18. ICARDA Projects dealing with CRP DS Some checks and balances...continued: •Benefiting from ICRISAT Tools and Techniques • on data checking tools • archiving the data on public platforms (an enforcer of Data Quality) • e.g. data systems from ICRISAT, Dataverse ( http://dvn.iq.harvard.edu/dvn/) •Computing tools/procedures: Training and development •Excel macros, Genstat/SAS/SPSS/R/other software •Database development/datasheet preparation/ archiving
  • 19. ICARDA Projects dealing with CRP DS An attractive specialization: •Data Science •The Data Scientist’s Toolbox: https://www.coursera.org/course/datascitoolbox
  • 20. ICARDA Projects dealing with CRP DS Crystalizing an approach: 4. CRP: Dryland Systems Management: Workflow components Home Center: Project/Meta data: Project ID, objectives, location, year, personnel (Planner, M&E team, data collector etc.), trial level information, factors (design and treatment), variables etc., A report of data validation in Step 2; links to data; •Data <<<< validation (via agreed tools) •Mechanism for Data Quality Check •1. Scientists >> >2. Statistician/DM team: apply the agreed tools • a) If fails-----> (1) to scientist for update • b) If passes----> Get metadata and links to data •2. Archiving (what? who will do this? DM Team?) • Sharing permissions etc. This could be a Workflow of permissions: Requester ---> Approval 1--->Approval 2 ---…---> Director CRP DS/nominee.
  • 21. ICARDA Projects dealing with CRP DS …..continued: •Information Management. This refers to the [statistically analysed] results files/publications generated. •Knowledge Management: Key findings, Implications, lessons learned NARS •Identify the active NARS partners •Training on the above tools and workflow, Share Policy and Procedure on CRP DS DM (OA) •Identify the risk factors and their indicators and develop an action plan with resources required •Measure and Monitor the impact