SlideShare uma empresa Scribd logo
1 de 18
INDIA HUMAN
DEVELOPMENT SURVEY
(IHDS)
TRAINING PROGRAM
MARCH 16, 2016
How to merge two rounds?
Merging Household Files
Relationship between IHDS-I
and IHDS-II households
IHDS-I sample
(N=41,554)
Replacement
households in
IHDS-II (N=2,134)
Split households
from round 1
(N=5,397)
Reinterview
Households
(N=34,621)
Attrition (N=6,911)
 Most important
concept in merging
two data files
1. Some households in
round 1 with no
match in round 2
and vice versa
2. Households in
round 1 match with
more than 1
household in round
2
Any questions?
 Who were chosen for reinterview?
 Recontact rate of 83%? What does it mean?
 How were replacement households chosen?
 What is a split household?
What is needed to merge
household files?
1. Round 1 household file – N=41,554
2. Round 2 household file – N=42,152
 (Why are there more cases in round 2?)
3. Linking file – N=42,152 – gives Round 1
identification codes for all Round 2
households that were reinterviewed, missing
linking codes for 2,134 households that are
new
Step 1 – Link round 2 data to
linking file to get round 1 ID
 use linkhh, clear
 sort STATEID DISTID PSUID HHID
HHSPLITID
 merge 1:1 STATEID DISTID PSUID HHID
HHSPLITID using round2HH
 sort STATEID DISTID PSUID HHID2005
HHSPLITID2005, gen(_mergeR2link)
 save round2HH_plus, replace
Step 2-Merge this Round 2+ file
with Round 1 file
 use round1HH
 rename HHID HHID2005
 rename HHSPLITID HHSPLITID2005
 sort STATEID DISTID PSUID HHID2005
HHSPLITID2005
 merge 1:m STATEID DISTID PSUID HHID2005
HHSPLITID2005 using round2HH_plus,
gen(_mergeR1R2)
 sort STATEID DISTID PSUID HHID HHSPLITID
 save mergedHHR1R2, replace
Cases in Merged file is superset
 Households surveyed in both rounds N=40,018
 Households surveyed in round 1 only (attrition)
N=6,911
 Households surveyd in round 2 only
(replacement) N=2,134
 Total N=49,063
 Keep only _mergeR1R2==3 for panel analysis
(N=40,018)
Merging Individual Files
Relationship between IHDS-I
and IHDS-II individuals
IHDS-I sample
(N=215,754)
New
individulas, new
HH (N=9,760)
New Ind in R1
HH (N=43,822)
Reinterview Ind
(N=150,995)
HH attrition
(N=29,299)
Ind. attrition in
interview hh
(N=35,464)
 Most important
concept in merging
two data files
1. Even reinterview
households have
new members
(births, marriages)
2. Even reinterview
households have
some members who
are no longer there
(deaths, marriages,
migration)
What is needed to merge
individual files?
1. Round 1 household file – N=215,754
2. Round 2 household file – N=204,568
 (Why are there more cases in round 2?)
3. Linking file – N=204,568 – gives Round 1
identification codes for all Round 2
households that were reinterviewed, missing
linking codes for 2,134 households that are
new
Step 1 – Link round 2 data to
linking file to get round 1 ID
 use linkind, clear
 sort STATEID DISTID PSUID HHID
HHSPLITID PERSONID
 merge 1:1 STATEID DISTID PSUID HHID
HHSPLITID PERONID using round2IND
 sort STATEID DISTID PSUID HHID2005
HHSPLITID2005, gen(_mergeR2link)
 save round2IND_plus, replace
Step 2-Merge this Round 2+ file
with Round 1 file
 use round1IND
 rename HHID HHID2005
 rename HHSPLITID HHSPLITID2005
 rename PERSONID PERSONID2005
 sort STATEID DISTID PSUID HHID2005
HHSPLITID2005 PERSONID2005
 merge 1:m STATEID DISTID PSUID HHID2005
HHSPLITID2005 PERSONID2005 using
round2IND_plus, gen(_mergeR1R2)
 sort STATEID DISTID PSUID HHID HHSPLITID
 save mergedINDR1R2, replace
Cases in Merged file is superset
 Individuals surveyed in both rounds N=150,988
 Individuals surveyed in round 1 only
(attrition/death/migration) N=64,766
 Individuals surveyd in round 2 only
(replacement/new) N=53,580
 Total N=269,334
 Keep only _mergeR1R2==3 for panel analysis
(N=150,988)
Evermarried woman file
linkage
Same process as individual file
linkage
 But only one thing to note, there was no ever
married woman file for 2004-5 so you will be
merging with the household file from 2004-5
Merging Caution
Merging overwrites variables
 So if you want to keep variables from round 1
and round 2 separate, before merging you may
want to rename all round 1 variables
 Typically we use the command
 Rename * x*
 Rename xSTATEID STATEID et. For merging
 So xr05 will be age in 20045 and r05 will be
age in 2011-12

Mais conteúdo relacionado

Mais procurados

Economic development
Economic development Economic development
Economic development
Mian Zahid
 
Marginal Efficiency Of Investment(Mei) Revised Feb 2011
Marginal Efficiency Of Investment(Mei) Revised Feb 2011Marginal Efficiency Of Investment(Mei) Revised Feb 2011
Marginal Efficiency Of Investment(Mei) Revised Feb 2011
Gary Crosbie
 
Leontief input output models.ppt final
Leontief input output models.ppt finalLeontief input output models.ppt final
Leontief input output models.ppt final
Kinnar Majithia
 
Theories On Development
Theories On DevelopmentTheories On Development
Theories On Development
Ecumene
 

Mais procurados (20)

Productivity in Australia
Productivity in AustraliaProductivity in Australia
Productivity in Australia
 
Economic development
Economic development Economic development
Economic development
 
Heckscher ohlin model theory of international trade
Heckscher ohlin model theory of international tradeHeckscher ohlin model theory of international trade
Heckscher ohlin model theory of international trade
 
Reciprocal dumping model
Reciprocal dumping modelReciprocal dumping model
Reciprocal dumping model
 
Marginal Efficiency Of Investment(Mei) Revised Feb 2011
Marginal Efficiency Of Investment(Mei) Revised Feb 2011Marginal Efficiency Of Investment(Mei) Revised Feb 2011
Marginal Efficiency Of Investment(Mei) Revised Feb 2011
 
measures-of-inequality.ppt
measures-of-inequality.pptmeasures-of-inequality.ppt
measures-of-inequality.ppt
 
HDI
HDIHDI
HDI
 
Leontief input output models.ppt final
Leontief input output models.ppt finalLeontief input output models.ppt final
Leontief input output models.ppt final
 
Gini coefficient
Gini coefficientGini coefficient
Gini coefficient
 
Human happiness index
Human happiness indexHuman happiness index
Human happiness index
 
Offer curve
Offer curveOffer curve
Offer curve
 
The linder hypothesis- Li Wenwen
The linder hypothesis- Li WenwenThe linder hypothesis- Li Wenwen
The linder hypothesis- Li Wenwen
 
Theories On Development
Theories On DevelopmentTheories On Development
Theories On Development
 
Chapter3 econometrics
Chapter3 econometricsChapter3 econometrics
Chapter3 econometrics
 
Uneven devt ec
Uneven devt ecUneven devt ec
Uneven devt ec
 
The effect of technical progress upon distribution along Kaldor-Kennedy line
The effect of technical progress upon distribution along Kaldor-Kennedy lineThe effect of technical progress upon distribution along Kaldor-Kennedy line
The effect of technical progress upon distribution along Kaldor-Kennedy line
 
The big push theory
The big push theoryThe big push theory
The big push theory
 
Phillips curve hypothesis
Phillips curve hypothesisPhillips curve hypothesis
Phillips curve hypothesis
 
Slides growth
Slides growthSlides growth
Slides growth
 
Export promotion in Tajikistan
Export promotion in TajikistanExport promotion in Tajikistan
Export promotion in Tajikistan
 

Destaque (12)

Merging for ihds.info
Merging for ihds.infoMerging for ihds.info
Merging for ihds.info
 
Merging files (Data Structure)
Merging files (Data Structure)Merging files (Data Structure)
Merging files (Data Structure)
 
Hashing PPT
Hashing PPTHashing PPT
Hashing PPT
 
Merging
Merging Merging
Merging
 
Algorithms for External Memory Sorting
Algorithms for External Memory SortingAlgorithms for External Memory Sorting
Algorithms for External Memory Sorting
 
3.9 external sorting
3.9 external sorting3.9 external sorting
3.9 external sorting
 
Hashing
HashingHashing
Hashing
 
Ch17 Hashing
Ch17 HashingCh17 Hashing
Ch17 Hashing
 
Sorting algorithms
Sorting algorithmsSorting algorithms
Sorting algorithms
 
Hashing Techniques in Data Structures Part2
Hashing Techniques in Data Structures Part2Hashing Techniques in Data Structures Part2
Hashing Techniques in Data Structures Part2
 
Hashing
HashingHashing
Hashing
 
Hashing Technique In Data Structures
Hashing Technique In Data StructuresHashing Technique In Data Structures
Hashing Technique In Data Structures
 

Último

Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
VIP Call Girls Bhavnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Bhavnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Bhavnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Bhavnagar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 

Último (20)

The Economic and Organised Crime Office (EOCO) has been advised by the Office...
The Economic and Organised Crime Office (EOCO) has been advised by the Office...The Economic and Organised Crime Office (EOCO) has been advised by the Office...
The Economic and Organised Crime Office (EOCO) has been advised by the Office...
 
Expressive clarity oral presentation.pptx
Expressive clarity oral presentation.pptxExpressive clarity oral presentation.pptx
Expressive clarity oral presentation.pptx
 
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
Lucknow 💋 Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8...
 
(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7
(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7
(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7
 
EDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxEDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptx
 
Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...
Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
 
Human-AI Collaboration for Virtual Capacity in Emergency Operation Centers (E...
Human-AI Collaborationfor Virtual Capacity in Emergency Operation Centers (E...Human-AI Collaborationfor Virtual Capacity in Emergency Operation Centers (E...
Human-AI Collaboration for Virtual Capacity in Emergency Operation Centers (E...
 
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
VIP Model Call Girls Kiwale ( Pune ) Call ON 8005736733 Starting From 5K to 2...
VIP Model Call Girls Kiwale ( Pune ) Call ON 8005736733 Starting From 5K to 2...VIP Model Call Girls Kiwale ( Pune ) Call ON 8005736733 Starting From 5K to 2...
VIP Model Call Girls Kiwale ( Pune ) Call ON 8005736733 Starting From 5K to 2...
 
PPT Item # 4 - 231 Encino Ave (Significance Only)
PPT Item # 4 - 231 Encino Ave (Significance Only)PPT Item # 4 - 231 Encino Ave (Significance Only)
PPT Item # 4 - 231 Encino Ave (Significance Only)
 
Booking open Available Pune Call Girls Shukrawar Peth 6297143586 Call Hot In...
Booking open Available Pune Call Girls Shukrawar Peth  6297143586 Call Hot In...Booking open Available Pune Call Girls Shukrawar Peth  6297143586 Call Hot In...
Booking open Available Pune Call Girls Shukrawar Peth 6297143586 Call Hot In...
 
Call On 6297143586 Yerwada Call Girls In All Pune 24/7 Provide Call With Bes...
Call On 6297143586  Yerwada Call Girls In All Pune 24/7 Provide Call With Bes...Call On 6297143586  Yerwada Call Girls In All Pune 24/7 Provide Call With Bes...
Call On 6297143586 Yerwada Call Girls In All Pune 24/7 Provide Call With Bes...
 
(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(NEHA) Bhosari Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
CBO’s Recent Appeals for New Research on Health-Related Topics
CBO’s Recent Appeals for New Research on Health-Related TopicsCBO’s Recent Appeals for New Research on Health-Related Topics
CBO’s Recent Appeals for New Research on Health-Related Topics
 
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
 
VIP Call Girls Bhavnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Bhavnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Bhavnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Bhavnagar 7001035870 Whatsapp Number, 24/07 Booking
 
Top Rated Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated  Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Top Rated  Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
 
Climate change and safety and health at work
Climate change and safety and health at workClimate change and safety and health at work
Climate change and safety and health at work
 
Climate change and occupational safety and health.
Climate change and occupational safety and health.Climate change and occupational safety and health.
Climate change and occupational safety and health.
 
2024: The FAR, Federal Acquisition Regulations - Part 29
2024: The FAR, Federal Acquisition Regulations - Part 292024: The FAR, Federal Acquisition Regulations - Part 29
2024: The FAR, Federal Acquisition Regulations - Part 29
 

Merging

  • 1. INDIA HUMAN DEVELOPMENT SURVEY (IHDS) TRAINING PROGRAM MARCH 16, 2016 How to merge two rounds?
  • 3. Relationship between IHDS-I and IHDS-II households IHDS-I sample (N=41,554) Replacement households in IHDS-II (N=2,134) Split households from round 1 (N=5,397) Reinterview Households (N=34,621) Attrition (N=6,911)  Most important concept in merging two data files 1. Some households in round 1 with no match in round 2 and vice versa 2. Households in round 1 match with more than 1 household in round 2
  • 4. Any questions?  Who were chosen for reinterview?  Recontact rate of 83%? What does it mean?  How were replacement households chosen?  What is a split household?
  • 5. What is needed to merge household files? 1. Round 1 household file – N=41,554 2. Round 2 household file – N=42,152  (Why are there more cases in round 2?) 3. Linking file – N=42,152 – gives Round 1 identification codes for all Round 2 households that were reinterviewed, missing linking codes for 2,134 households that are new
  • 6. Step 1 – Link round 2 data to linking file to get round 1 ID  use linkhh, clear  sort STATEID DISTID PSUID HHID HHSPLITID  merge 1:1 STATEID DISTID PSUID HHID HHSPLITID using round2HH  sort STATEID DISTID PSUID HHID2005 HHSPLITID2005, gen(_mergeR2link)  save round2HH_plus, replace
  • 7. Step 2-Merge this Round 2+ file with Round 1 file  use round1HH  rename HHID HHID2005  rename HHSPLITID HHSPLITID2005  sort STATEID DISTID PSUID HHID2005 HHSPLITID2005  merge 1:m STATEID DISTID PSUID HHID2005 HHSPLITID2005 using round2HH_plus, gen(_mergeR1R2)  sort STATEID DISTID PSUID HHID HHSPLITID  save mergedHHR1R2, replace
  • 8. Cases in Merged file is superset  Households surveyed in both rounds N=40,018  Households surveyed in round 1 only (attrition) N=6,911  Households surveyd in round 2 only (replacement) N=2,134  Total N=49,063  Keep only _mergeR1R2==3 for panel analysis (N=40,018)
  • 10. Relationship between IHDS-I and IHDS-II individuals IHDS-I sample (N=215,754) New individulas, new HH (N=9,760) New Ind in R1 HH (N=43,822) Reinterview Ind (N=150,995) HH attrition (N=29,299) Ind. attrition in interview hh (N=35,464)  Most important concept in merging two data files 1. Even reinterview households have new members (births, marriages) 2. Even reinterview households have some members who are no longer there (deaths, marriages, migration)
  • 11. What is needed to merge individual files? 1. Round 1 household file – N=215,754 2. Round 2 household file – N=204,568  (Why are there more cases in round 2?) 3. Linking file – N=204,568 – gives Round 1 identification codes for all Round 2 households that were reinterviewed, missing linking codes for 2,134 households that are new
  • 12. Step 1 – Link round 2 data to linking file to get round 1 ID  use linkind, clear  sort STATEID DISTID PSUID HHID HHSPLITID PERSONID  merge 1:1 STATEID DISTID PSUID HHID HHSPLITID PERONID using round2IND  sort STATEID DISTID PSUID HHID2005 HHSPLITID2005, gen(_mergeR2link)  save round2IND_plus, replace
  • 13. Step 2-Merge this Round 2+ file with Round 1 file  use round1IND  rename HHID HHID2005  rename HHSPLITID HHSPLITID2005  rename PERSONID PERSONID2005  sort STATEID DISTID PSUID HHID2005 HHSPLITID2005 PERSONID2005  merge 1:m STATEID DISTID PSUID HHID2005 HHSPLITID2005 PERSONID2005 using round2IND_plus, gen(_mergeR1R2)  sort STATEID DISTID PSUID HHID HHSPLITID  save mergedINDR1R2, replace
  • 14. Cases in Merged file is superset  Individuals surveyed in both rounds N=150,988  Individuals surveyed in round 1 only (attrition/death/migration) N=64,766  Individuals surveyd in round 2 only (replacement/new) N=53,580  Total N=269,334  Keep only _mergeR1R2==3 for panel analysis (N=150,988)
  • 16. Same process as individual file linkage  But only one thing to note, there was no ever married woman file for 2004-5 so you will be merging with the household file from 2004-5
  • 18. Merging overwrites variables  So if you want to keep variables from round 1 and round 2 separate, before merging you may want to rename all round 1 variables  Typically we use the command  Rename * x*  Rename xSTATEID STATEID et. For merging  So xr05 will be age in 20045 and r05 will be age in 2011-12