SlideShare uma empresa Scribd logo
1 de 14
Baixar para ler offline
The Distribution of References
in Scientific Papers:
an Analysis of the IMRaD Structure
ISSI 2013
Vienna, 16 July 2013
Marc Bertin, Iana Atanassova, Vincent Lariviere, Yves Gingras
Problem
Scientific papers usually follow a specific
rhetorical structure: the IMRaD structure
(Introduction, Method, Result and Discussion).
Questions:Questions:
What relationships exist between cited
references and the structure of the text?
How does the IMRaD structure affect the
distribution of references in scientific
papers?
Method
Corpus: 7 peer-reviewed academic journals:
PLoS series (ONE, Biology, Computational Biology,
Genetics, Medicine, Neglected Tropical Diseases,
Pathogens)
XML using Journal Article Tag Suite (JATS)XML using Journal Article Tag Suite (JATS)
More than 47,000 scientific articles
Identify the section structure of the articles
Identify cited references in the text
Study the distribution of references according
to the text progression and structure.
Sections Identification
• Section titles can vary according to the
article.
• e.g. "Method", "Methods", "Method and
Model"Model"
• Section titles were analyzed in order to
match each section with one of the
section types in the IMRaD structure.
Sentence Level Processing
We use sentences as basic units to model
text progression
Sentence segmentation allows us to work
with text elements that are smaller than
paragraphsparagraphs
Analysis of the punctuation of the text
following a set of typographic rules
For each sentence, we count the number of
references it contains and obtain their
distribution along the text.
Corpus
Cited References
Cited references are present as separate
elements in the XML structure
Special cases needing specific processing:
reference ranges
ResultsResults
PLoS ONE &
PLoS Computational Biology
PloS Genetics, PLoS
Pathogens & PLoS Biology
PLoS Medicine & PLoS
Neglected Tropical Diseases
IMRaD Structure
Conclusion
We have obtained the distribution of
cited references in scientific papers.
We have shown that this distribution
seems quite stable and maybe evenseems quite stable and maybe even
invariant if we take into account the
changes that occur in some journals in
the positions of the different sections in
the text of the articles.
Thank you!Thank you!

Mais conteúdo relacionado

Semelhante a The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

briefly describe the main biochemical actions of telomerase and discuss.docx
briefly describe the main biochemical actions of telomerase and discuss.docxbriefly describe the main biochemical actions of telomerase and discuss.docx
briefly describe the main biochemical actions of telomerase and discuss.docx
sdfghj21
 
DirectionsLocate the annotated bibliography and outline you.docx
DirectionsLocate the annotated bibliography and outline you.docxDirectionsLocate the annotated bibliography and outline you.docx
DirectionsLocate the annotated bibliography and outline you.docx
kimberly691
 

Semelhante a The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013 (20)

Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
 
2015.ESP
2015.ESP2015.ESP
2015.ESP
 
Analysing Author Name Mentions In Citation Contexts Of Highly Cited Publications
Analysing Author Name Mentions In Citation Contexts Of Highly Cited PublicationsAnalysing Author Name Mentions In Citation Contexts Of Highly Cited Publications
Analysing Author Name Mentions In Citation Contexts Of Highly Cited Publications
 
briefly describe the main biochemical actions of telomerase and discuss.docx
briefly describe the main biochemical actions of telomerase and discuss.docxbriefly describe the main biochemical actions of telomerase and discuss.docx
briefly describe the main biochemical actions of telomerase and discuss.docx
 
A Citation Centric Annotation Scheme For Scientific Articles
A Citation Centric Annotation Scheme For Scientific ArticlesA Citation Centric Annotation Scheme For Scientific Articles
A Citation Centric Annotation Scheme For Scientific Articles
 
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
 
APA manual PPT 2
APA manual PPT 2APA manual PPT 2
APA manual PPT 2
 
Cocitation Networks and Random Walk
Cocitation Networks and Random WalkCocitation Networks and Random Walk
Cocitation Networks and Random Walk
 
Literature Review Matrix
Literature Review MatrixLiterature Review Matrix
Literature Review Matrix
 
CHEM281 2012
CHEM281 2012CHEM281 2012
CHEM281 2012
 
Ibn Sina
Ibn SinaIbn Sina
Ibn Sina
 
Data Mining in Rediology reports
Data Mining in Rediology reportsData Mining in Rediology reports
Data Mining in Rediology reports
 
A knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systemsA knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systems
 
DirectionsLocate the annotated bibliography and outline you.docx
DirectionsLocate the annotated bibliography and outline you.docxDirectionsLocate the annotated bibliography and outline you.docx
DirectionsLocate the annotated bibliography and outline you.docx
 
3. learning from other and reviewing the literature
3. learning from other and reviewing the literature3. learning from other and reviewing the literature
3. learning from other and reviewing the literature
 
Semantic Integration for Heterogeneous Domain-specific Information: The NIF Case
Semantic Integration for Heterogeneous Domain-specific Information: The NIF CaseSemantic Integration for Heterogeneous Domain-specific Information: The NIF Case
Semantic Integration for Heterogeneous Domain-specific Information: The NIF Case
 
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
 
Marcu 2000 presentation
Marcu 2000 presentationMarcu 2000 presentation
Marcu 2000 presentation
 
7 calais
7 calais7 calais
7 calais
 
7 calais
7 calais7 calais
7 calais
 

Último

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
Bhagirath Gogikar
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 

Último (20)

Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Unit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 oUnit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 o
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 

The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

  • 1. The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure ISSI 2013 Vienna, 16 July 2013 Marc Bertin, Iana Atanassova, Vincent Lariviere, Yves Gingras
  • 2. Problem Scientific papers usually follow a specific rhetorical structure: the IMRaD structure (Introduction, Method, Result and Discussion). Questions:Questions: What relationships exist between cited references and the structure of the text? How does the IMRaD structure affect the distribution of references in scientific papers?
  • 3. Method Corpus: 7 peer-reviewed academic journals: PLoS series (ONE, Biology, Computational Biology, Genetics, Medicine, Neglected Tropical Diseases, Pathogens) XML using Journal Article Tag Suite (JATS)XML using Journal Article Tag Suite (JATS) More than 47,000 scientific articles Identify the section structure of the articles Identify cited references in the text Study the distribution of references according to the text progression and structure.
  • 4. Sections Identification • Section titles can vary according to the article. • e.g. "Method", "Methods", "Method and Model"Model" • Section titles were analyzed in order to match each section with one of the section types in the IMRaD structure.
  • 5. Sentence Level Processing We use sentences as basic units to model text progression Sentence segmentation allows us to work with text elements that are smaller than paragraphsparagraphs Analysis of the punctuation of the text following a set of typographic rules For each sentence, we count the number of references it contains and obtain their distribution along the text.
  • 7. Cited References Cited references are present as separate elements in the XML structure Special cases needing specific processing: reference ranges
  • 9. PLoS ONE & PLoS Computational Biology
  • 11. PLoS Medicine & PLoS Neglected Tropical Diseases
  • 13. Conclusion We have obtained the distribution of cited references in scientific papers. We have shown that this distribution seems quite stable and maybe evenseems quite stable and maybe even invariant if we take into account the changes that occur in some journals in the positions of the different sections in the text of the articles.