HCUP Methods Series Calculating National Inpatient Sample (NIS) Variances for Data Years 2012 and Later

HCUP Methods Series

HCUP Methods Series Calculating National Inpatient Sample (NIS) Variances for Data Years 2012 and Later

Report #2015-09

December 10, 2014

Contact Information:

Healthcare Cost and Utilization Project (HCUP)
Agency for Healthcare Research and Quality
5600 Fishers Lane
Room 07W17B
Mail Stop Number 7W25B
Rockville, MD 20857
http://www.hcup-us.ahrq.gov

For Technical Assistance with HCUP Products:

Email: hcup@ahrq.gov

or

Phone: 1-866-290-HCUP

Recommended Citation: Houchens R, Ross D, Elixhauser A. Final Report on Calculating National Inpatient Sample (NIS) Variances for Data Years 2012 and Later. 2015. HCUP Methods Series Report # 2015-09 ONLINE. December 14, 2015. U.S. Agency for Healthcare Research and Quality. Available:
http://www.hcup-us.ahrq.gov/reports/methods/methods.jsp

TABLE OF CONTENTS

Skip Table of Contents

PREFACE
EXECUTIVE SUMMARY
INTRODUCTION
- NIS Sample Design
- NIS Sample Weights
MISSING VALUES
RATIONALE AND FORMULAS FOR NIS VARIANCE CALCULATIONS
EXAMPLES OF NIS VARIANCE CALCULATIONS
DISCUSSION
- Alternative Concepts of Variance
- Estimation Techniques
CONCLUSIONS
APPENDIX A: CODE FOR ANALYZING SUBPOPULATIONS USING SAS AND STATA
- SAS Programming Statements
- Stata Programming Statements

LIST OF TABLES

Table 1. SAS Output for DIABETES = 1, Domain Statistics
Table 2. Stata Output for Diabetes = 1
Table 3. Comparison of SAS and Stata Results for Complicated Diabetes, National Inpatient Sample (NIS), 2012

PREFACE

This version of the National Inpatient Sample (NIS) variance report applies to the latest NIS sample design, which is effective for data years 2012 and later. For data years 2011 and earlier, users should consult the previous version of the NIS variance report. ¹

EXECUTIVE SUMMARY

The Healthcare Cost and Utilization Project (HCUP) is a Federal-State-Industry partnership to build a standardized, multistate health data system. HCUP databases contain encounter-level healthcare data submitted by participating States. One HCUP database, the National Inpatient Sample (NIS), is the largest all-payer inpatient care database in the United States.

Beginning with the 2012 data year, the NIS is a stratified sample of discharges from all hospitals in the sampling frame defined by States that make their data available to the HCUP project and that can be matched to data from the American Hospital Association (AHA) Annual Survey of Hospitals. Hospitals are stratified by region, location and teaching status, bed size category, and ownership. Prior to the 2012 NIS, the samples included all discharges from a sample of hospitals in the sampling frame. For data prior to 2012, see the document titled Calculating Nationwide Inpatient Sample Variances, Data Year 2011 and Earlier.²

This document describes how to calculate simple statistics, including variances, from the NIS while taking into account the sampling design and sample discharge weights. Data from the 2012 NIS are used in all examples in this report, although the same methods can be applied to all subsequent data years. This report contains the program code required to calculate sample totals, means, rates, and their variances with two commonly used statistical programming languages that run on personal computers: SAS (SAS Institute Inc) and Stata (StataCorp LP). This report also provides results of example calculations from both statistical packages and demonstrates that the results are virtually the same for both statistical packages.

Two approaches to calculating variances for subpopulations are suggested. The first, described in the body of the report, uses the entire NIS sample. The second, described in Appendix A, uses only the subsample of the NIS corresponding to the subpopulation of interest. Finally, we discuss alternative concepts of variance and other methods that could be applied to calculating variances.

Variable	Label	Mean	Std Error of Mean	Sum	Std Dev
DISCHGS		1.000000	0	528,030	5,451.714493
LOS	Length of stay (cleaned)	4.601903	0.025091	2,429,116	29,480
DIED	Died during hospitalization	0.005531	0.000231	2919.999322	124.540093
TOTCHG	Total charges (cleaned)	33,914	422.923358	17,532,625,874	290,339,323

Output Measure	Survey Total Estimation, Discharges	Survey Mean Estimation, Length of Stay	Survey Mean Estimation, Total Charges	Survey Ratio Estimation, Died/Dischgs
No. of strata†	195	195	195	195
No. of PSUs	4,375	4,375	4,375	4,375
No. of observations*	7,296,566	7,296,530	7,294,355	7,296,544
Population size*	36,482,837	36,482,657	36,471,782	36,482,727
No. of observations, subpopulation*	105,606	105,570	103,395	105,584
Subpopulation size*	528,030.28	527,850.28	516,975.24	527,920.28
Design degrees of freedom	4,180	4,180	4,180	4,180
Linearized
Total or Mean	528,030.3	4.601903	33,913.86	.0055311
Standard error	5,451.715	.0250909	422.9234	.0002311
95% confidence interval	517,342-538,718.5	4.552712-4.651095	33,084.71-34,743.01	.0050781-.0059842

Variable	SAS		Stata
Variable	No.	Standard Error	No.	Standard Error
Total discharges	528,030	5,452	528,030	5,452
In-hospital mortality, %	.553	.0231	.553	.0231
Mean length of stay, days	4.60	.025	4.60	.025
Mean total charges, $	33,914	423	33,914	423

User Support

HCUP Methods Series HCUP Methods Series Calculating National Inpatient Sample (NIS) Variances for Data Years 2012 and Later Report #2015-09 December 10, 2014

HCUP Methods Series

HCUP Methods Series Calculating National Inpatient Sample (NIS) Variances for Data Years 2012 and Later

Report #2015-09

December 10, 2014