Uitgebreid zoeken
Boek

Best Practices In Data Cleaning

A Complete Guide To Everything You Need To Do Before And After Collecting Your Data

Best Practices In Data Cleaning - Osborne, Jason W. - ISBN: 9781412988018
Prijs: € 31,85
Levertijd: 3 tot 5 werkdagen
Bindwijze: Boek, Paperback
Genre: Sociologie algemeen
Add to cart

Beschrijving

This Book Provides A Clear, Step-by-step Process Of Examining And Cleaning Data In Order To Decrease Error Rates And Increase Both The Power And Replicability Of Results.

Details

Titel: Best Practices In Data Cleaning
Auteur: Osborne, Jason W.
Mediatype: Boek
Bindwijze: Paperback
Taal: Engels
Aantal pagina's: 296
Uitgever: Sage Publications Inc
Plaats van publicatie: 01
NUR: Sociologie algemeen
Afmetingen: 229 x 152 x 20
Gewicht: 340 gr
ISBN/ISBN13: 9781412988018
Intern nummer: 18134634

Inhoudsopgave

Preface xi
About the Author xv
Chapter 1 Why Data Cleaning Is Important: Debunking the Myth of Robustness
1(16)
Origins of Data Cleaning
2(3)
Are Things Really That Bad?
5(3)
Why Care About Testing Assumptions and Cleaning Data?
8(1)
How Can This State of Affairs Be True?
8(2)
The Best Practices Orientation of This Book
10(1)
Data Cleaning Is a Simple Process; However...
11(1)
One Path to Solving the Problem
12(1)
For Further Enrichment
13(4)
SECTION I BEST PRACTICES AS YOU PREPARE FOR DATA COLLECTION
17(68)
Chapter 2 Power and Planning for Data Collection: Debunking the Myth of Adequate Power
19(24)
Power and Best Practices in Statistical Analysis of Data
20(2)
How Null-Hypothesis Statistical Testing Relates to Power
22(1)
What Do Statistical Tests Tell Us?
23(3)
How Does Power Relate to Error Rates?
26(2)
Low Power and Type I Error Rates in a Literature
28(1)
How to Calculate Power
29(2)
The Effect of Power on the Replicability of Study Results
31(2)
Can Data Cleaning Fix These Sampling Problems?
33(1)
Conclusions
34(1)
For Further Enrichment
35(1)
Appendix
36(7)
Chapter 3 Being True to the Target Population: Debunking the Myth of Representativeness
43(28)
Sampling Theory and Generalizability
45(1)
Aggregation or Omission Errors
46(3)
Including Irrelevant Groups
49(3)
Nonresponse and Generalizability
52(2)
Consent Procedures and Sampling Bias
54(2)
Generalizability of Internet Surveys
56(2)
Restriction of Range
58(4)
Extreme Groups Analysis
62(3)
Conclusion
65(1)
For Further Enrichment
65(6)
Chapter 4 Using Large Data Sets With Probability Sampling Frameworks: Debunking the Myth of Equality
71(14)
What Types of Studies Use Complex Sampling?
72(1)
Why Does Complex Sampling Matter?
72(2)
Best Practices in Accounting for Complex Sampling
74(2)
Does It Really Make a Difference in the Results?
76(4)
So What Does All This Mean?
80(1)
For Further Enrichment
81(4)
SECTION II BEST PRACTICES IN DATA CLEANING AND SCREENING
85(126)
Chapter 5 Screening Your Data for Potential Problems: Debunking the Myth of Perfect Data
87(18)
The Language of Describing Distributions
90(3)
Testing Whether Your Data Are Normally Distributed
93(7)
Conclusions
100(1)
For Further Enrichment
101(1)
Appendix
101(4)
Chapter 6 Dealing With Missing or Incomplete Data: Debunking the Myth of Emptiness
105(34)
What Is Missing or Incomplete Data?
106(3)
Categories of Missingness
109(1)
What Do We Do With Missing Data?
110(7)
The Effects of Listwise Deletion
117(1)
The Detrimental Effects of Mean Substitution
118(4)
The Effects of Strong and Weak Imputation of Values
122(3)
Multiple Imputation: A Modern Method of Missing Data Estimation
125(3)
Missingness Can Be an Interesting Variable in and of Itself
128(2)
Summing Up: What Are Best Practices?
130(1)
For Further Enrichment
131(1)
Appendixes
132(7)
Chapter 7 Extreme and Influential Data Points: Debunking the Myth of Equality
139(30)
What Are Extreme Scores?
140(1)
How Extreme Values Affect Statistical Analyses
141(1)
What Causes Extreme Scores?
142(7)
Extreme Scores as a Potential Focus of Inquiry
149(3)
Identification of Extreme Scores
152(1)
Why Remove Extreme Scores?
153(3)
Effect of Extreme Scores on Inferential Statistics
156(1)
Effect of Extreme Scores on Correlations and Regression
156(5)
Effect of Extreme Scores on t-Tests and ANOVAs
161(4)
To Remove or Not to Remove?
165(1)
For Further Enrichment
165(4)
Chapter 8 Improving the Normality of Variables Through Box-Cox Transformation: Debunking the Myth of Distributional Irrelevance
169(22)
Why Do We Need Data Transformations?
171(1)
When a Variable Violates the Assumption of Normality
171(1)
Traditional Data Transformations for Improving Normality
172(4)
Application and Efficacy of Box-Cox Transformations
176(5)
Reversing Transformations
181(3)
Conclusion
184(1)
For Further Enrichment
185(1)
Appendix
185(6)
Chapter 9 Does Reliability Matter? Debunking the Myth of Perfect Measurement
191(20)
What Is a Reasonable Level of Reliability?
192(1)
Reliability and Simple Correlation or Regression
193(2)
Reliability and Partial Correlations
195(2)
Reliability and Multiple Regression
197(1)
Reliability and Interactions in Multiple Regression
198(1)
Protecting Against Overcorrecting During Disattenuation
199(1)
Other Solutions to the Issue of Measurement Error
200(1)
What If We Had Error-Free Measurement?
200(2)
An Example From My Research
202(3)
Does Reliability Influence Other Analyses?
205(1)
The Argument That Poor Reliability Is Not That Important
206(1)
Conclusions and Best Practices
207(1)
For Further Enrichment
208(3)
SECTION III ADVANCED TOPICS IN DATA CLEANING
211(54)
Chapter 10 Random Responding, Motivated Misresponding, and Response Sets: Debunking the Myth of the Motivated Participant
213(18)
What Is a Response Set?
213(1)
Common Types of Response Sets
214(2)
Is Random Responding Truly Random?
216(1)
Detecting Random Responding in Your Research
217(2)
Does Random Responding Cause Serious Problems With Research?
219(1)
Example of the Effects of Random Responding
219(5)
Are Random Responders Truly Random Responders?
224(1)
Summary
224(1)
Best Practices Regarding Random Responding
225(1)
Magnitude of the Problem
226(1)
For Further Enrichment
226(5)
Chapter 11 Why Dichotomizing Continuous Variables Is Rarely a Good Practice: Debunking the Myth of Categorization
231(22)
What Is Dichotomization and Why Does It Exist?
233(1)
How Widespread Is This Practice?
234(2)
Why Do Researchers Use Dichotomization?
236(1)
Are Analyses With Dichotomous Variables Easier to Interpret?
236(1)
Are Analyses With Dichotomous Variables Easier to Compute?
237(1)
Are Dichotomous Variables More Reliable?
238(8)
Other Drawbacks of Dichotomization
246(4)
For Further Enrichment
250(3)
Chapter 12 The Special Challenge of Cleaning Repeated Measures Data: Lots of Pits in Which to Fall
253(8)
Treat All Time Points Equally
253(4)
What to Do With Extreme Scores?
257(1)
Missing Data
258(1)
Summary
258(3)
Chapter 13 Now That the Myths Are Debunked...: Visions of Rational Quantitative Methodology for the 21st Century
261(4)
Name Index 265(4)
Subject Index 269

Winkelvoorraad

Dit product is op dit moment niet op voorraad in een van onze vestigingen.