HEALTHCARE COST & UTILIZATION PROJECT

User Support

Do Your own analysis
Explore Expert Research & Limited Datasets

Producing National HCUP Estimates - Accessible Version


Producing National HCUP Estimates


Welcome

Thank you for joining us for this Healthcare Cost and Utilization Project (HCUP) online tutorial on producing national and regional estimates. This tutorial was created for researchers who are using HCUP nationwide databases, understand the design of the nationwide databases, and are ready to produce national and regional estimates.

In this tutorial you will learn how to produce national and regional estimates by weighting the unweighted HCUP data.


Contents:
  1. Introduction
    1. About HCUP
    2. Learning Objectives
    3. Weighting HCUP Data
  2. NIS
    1. National (Nationwide) Inpatient Sample (NIS)
    2. NIS Redesign
    3. Prior to the 2012 NIS
    4. NIS Weights
    5. About the Demonstrations
    6. NIS Unweighted Discharge Record Count
    7. NIS National Discharge-Level Estimates
    8. NIS Regional Discharge-Level Estimates
    9. NIS Discharge Weights Over Time
    10. NIS Hospital Weights
    11. Data Elements for NIS Hospital Weights Over Time
    12. NIS Unweighted Analysis
    13. NIS Summary
  3. NEDS
    1. Nationwide Emergency Department Sample (NEDS)
    2. NEDS Discharge Weights
    3. NEDS Unweighted Discharge Record Count
    4. NEDS National Discharge-Level Estimates
    5. NEDS Regional Discharge-Level Estimates
    6. NEDS Hospital Weights
    7. NEDS Unweighted Hospital ED Record Count
    8. NEDS National Hospital ED-Level Estimate
    9. NEDS Unweighted Analysis
    10. NEDS Summary
  4. KID
    1. Kids' Inpatient Database (KID)
    2. KID Discharge Weights
    3. KID Unweighted Discharge Record Count
    4. KID National Discharge-Level Estimates
    5. KID Regional Discharge-Level Estimates
    6. KID Weights Over Time
    7. KID Limitations
    8. KID Summary
  5. Wrap-Up
    1. Key Points
    2. Resources and Other Training

About HCUP

Before we get started, a quick word about HCUP:

HCUP is sponsored by the Agency for Healthcare Research and Quality (AHRQ). HCUP is a family of databases, software tools, and related research products that enable research on a variety of healthcare topics.

If you are unfamiliar with HCUP or would like a refresher, please consider taking our General Overview Course.

Return to Contents


Learning Objectives

The first objective is to understand how three nationwide databases - the NIS (National Inpatient Sample); the NEDS (Nationwide Emergency Department Sample); and the KID (Kids' Inpatient Database) can be weighted to produce national and regional estimates.

A separate tutorial for the nationwide database, the NRD, or Nationwide Readmissions Database, explains how to apply weights when using the NRD database.

The second objective is to select and apply the appropriate discharge or hospital weight to generate national estimates at the discharge or hospital level from unweighted record counts.

The third objective is to understand when it is appropriate to use the NIS and NEDS databases as unweighted samples. This module introduces weighting each of the three nationwide HCUP databases.

Please note: The Nationwide Readmissions Database (NRD), has its own tutorial that explains how to apply weights to produce national and regional estimates when using that database. Please refer to the Nationwide Readmissions Database (NRD) tutorial on the HCUP-US website. Return to Contents


Weighting HCUP Data

Why do we need to weight HCUP data?

Most researchers working with the HCUP nationwide databases are interested in using the data to create national and regional estimates.

The HCUP nationwide databases are samples designed to represent a larger universe - data must be weighted in order to achieve national and regional estimates.

For an in-depth explanation of the sample designs, you can access the HCUP online course on Sample Design of Nationwide Databases. The next sections will cover how each nationwide database can be used to produce national and regional estimates.

Return to Contents


National Inpatient Sample (NIS)

The National Inpatient Sample (NIS) is a database of hospital inpatient discharges which can be used to create national and regional estimates of hospital utilization, access, costs and quality.

To perform such analyses on the NIS data contained in the Core File, you must weight the unweighted observations.

Weighting the data will enable you to produce nationally representative estimates.

Return to Contents


NIS Redesign

In 2012, the NIS was redesigned to improve national estimates.

The previous NIS was composed of all discharges from a sample of hospitals in HCUP.

The redesigned NIS is sample of discharges from all hospitals in HCUP.

To highlight the design change, AHRQ renamed the Nationwide Inpatient Sample (NIS) to the National Inpatient Sample (NIS).

For detailed information on the 2012 NIS Redesign, see the NIS Redesign Report.

Return to Contents


Prior to the 2012 NIS

There are several changes to the NIS beginning with 2012 data.

The new NIS, beginning with the 2012 data year, is called the "National" Inpatient Sample. The previous NIS, which included data years 1988 to 2011, was called the "Nationwide" Inpatient Sample.

Whereas, the previous NIS universe included long-term acute care hospitals, and annual discharge estimates and hospital identification was were based on information from the American Hospital Association or AHA, the new NIS removes long-term acute care hospitals. Annual discharge estimates and hospital identification are based on information from the SID when available. Otherwise, they are based on AHA information.

Whereas the strata for the previous NIS used hospital census regions for stratification, the new NIS uses hospital census divisions for stratification.

Whereas the sample design for the previous NIS was to sample 1,000 hospitals, amounting to more than 8 million records, the sample design for the new NIS is to sample 7 million discharge records from more than 4,000 hospitals.

This new sampling strategy results in estimates with more precise statistical properties than the previous NIS design. This table has a summary of the 2012 NIS redesign.

  New NIS (beginning in 2012 Data Year) Previous NIS (1998-2011 Data Years)
Name National Inpatient Sample (NIS) Nationwide Inpatient Sample (NIS)
Universe Removed long-term acute care hospitals Included long-term acute care hospitals
Annual hospital discharge count estimates and hospital entities based on information from the SID when available, otherwise, based on AHA information Annual hospital discharge count estimates and hospital identification based on information from AHA
Strata Used hospital census division (9) for stratification Used hospital census region (4) for stratification
Sample Design 7 million hospital discharge records from more than 4,000 hospitals 8 million records from more than 1,000 hospitals




NIS Weights

The weights you apply to the data depend on the type of estimates you want to produce.

The NIS includes weights to produce national or regional estimates. NIS data for years prior to 2012 includes both hospital and discharge weights. The hospital weights can be used to produce hospital-level estimates, and the discharge weights can be used to produce discharge-level estimates.

Beginning with data from 2012 after the NIS redesign, hospital weights are not included in the data because they are no longer needed.

NIS data for years 2012 onwards, should only be weighted to produce discharge-level estimates.

To accurately produce discharge-level estimates, such as estimates of the total number of discharges with a diagnosis of asthma in the United States or estimates of the total number of discharges in the United States for individuals age 65 and over, you must apply a discharge weight to each record in the NIS Core File.

The discharge weights were calculated for NIS data by first stratifying the NIS hospitals on the same variables that were used for creating the sample.

These variables were Census division, urban/rural location, teaching status, bed size, and ownership.

A weight was then calculated for each stratum, by dividing the number of universe discharges in that stratum, obtained from HCUP and the AHA data, by the number of NIS discharges in the stratum. Weighted estimates can be calculated by applying the discharge weights to the sample discharges.

Weights have been assigned to each discharge and are stored in each record in the data element discharge weight (DISCWT).

When the discharge weights are applied to the unweighted NIS data, the result is an estimate of the number of discharges for the entire universe.

In the case of the NIS, the universe is all inpatient discharges from community hospitals in the U.S., excluding rehabilitation hospitals beginning with 1998, and excluding long-term acute-care hospitals beginning with 2012.

Return to Contents


About the Demonstrations

This tutorial will use SAS® to demonstrate how to weight HCUP data to produce national and regional estimates. In addition to SAS®, there are several other statistical software packages which can produce statistics from the stratified sampling design of the nationwide HCUP databases. STATA® and SPSS® are two commonly used examples.

For a more detailed explanation of how to use these software packages to work with the nationwide HCUP databases, please refer to the documentation available on Methods Report on HCUP-US, including the Calculating National Inpatient Sample (NIS) Variances for Data Years 2012 and Later.

During all demonstrations, this tutorial will refer to CCS categories. Clinical Classification Software utilizes a categorization scheme that collapses the universe of ICD-9-CM diagnosis codes into over 280 clinically meaningful diagnosis categories. It does the same for procedure codes. The CCS categorization scheme has been applied to the records within the HCUP databases, and the CCS codes are stored in each record.

Return to Contents


NIS Unweighted Discharge Record Count

As a means of demonstration in this tutorial, we will tabulate the unweighted number of records in the NIS for which asthma is indicated as a principal diagnosis. First determine which records do and do not have asthma listed as the principal diagnosis.


Title1 "Count Records with DXCCS1=128 (Asthma) from 2014 NIS"; 
Libname NIS2014	"C:\NIS\2014\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;

data asthma;
set NIS2014.nis_2014_core 
    (keep=KEY_NIS HOSP_NIS DISCWT NIS_STRATUM DXCCS1);
    Attrib DXCCS1 Length=3 Label='Asthma DXCCS1=128';
    if dxccs1 eq 128 then asthma = 1;
    else asthma = 0;
run;

PROC SURVEYMEANS DATA=asthma SUM STD MEAN STDERR ;
    VAR Asthma;
    CLUSTER HOSP_NIS ;
    STRATA NIS_STRATUM ;
run;
          
  1. The CCS category for asthma includes 14 ICD-9 codes, so any record with one of those 14 codes listed as the principal diagnosis is assigned the CCS code 128 for asthma. Note that asthma is CCS code 128, so look for records in which DXCCS1 equals 128.
  2. 
    data asthma;
    set NIS2014.nis_2014_core 
        (keep=KEY_NIS HOSP_NIS DISCWT NIS_STRATUM DXCCS1);
        Attrib DXCCS1 Length=3 Label='Asthma DXCCS1=128';
        if dxccs1 eq 128 then asthma = 1;
        else asthma = 0;
    run;
     
  3. Use PROC SURVEYMEANS to generate statistics about the records which do and do not have the CCS code of 128 (asthma listed as a principal diagnosis). The SURVEYMEANS statement accounts for the complex sample design of the NIS.
  4. 
    PROC SURVEYMEANS DATA=asthma SUM STD MEAN STDERR ;
        VAR Asthma;
        CLUSTER HOSP_NIS ;
        STRATA NIS_STRATUM ;
    run;
    
  5. The resulting output will contain a data summary of the number of strata, clusters, and total observations in the data. In this example, the summary confirms a database composed of 206 sample strata containing 4,411 clusters - each cluster representing a single hospital - and 7,071,762 records - the number of records in the 2014 NIS. Although the examples employ the 2014 NIS, the program code is suitable for all NIS years except that the data element HOSP_NIS is replaced by HOSPID for 2011 and earlier.

  6. 
              Count Records with DXCCS1=128 (Asthma) from 2014 NIS File         
                                                                            
                               The SURVEYMEANS Procedure                        
                                                                            
                                     Data Summary                              
                                                                            
    				Number of Strata                 206                  
    				Number of Clusters              4411                  
    				Number of Observations       7071762                  
                                                                            
                                                                            
                                 Statistics                               
                                                                            
                                   Std Error                       Std Error
    Variable            Mean         of Mean             Sum          of Sum
    ------------------------------------------------------------------------
    Asthma          0.009613        0.000177           67978     1300.693790
     
  7. The statistics section provides the results of the analysis. The output in this example confirms that the number of records in the NIS with a principal diagnosis of asthma is 67,978. Remember, this is the number of records in the NIS for which asthma is indicated as a principal diagnosis. Without weighting, it is not an estimate of the number of hospital discharges nationwide for asthma.

                               Std Error                       Std Error
Variable            Mean         of Mean             Sum          of Sum
------------------------------------------------------------------------
Asthma          0.009613        0.000177           67978     1300.693790


NIS National Discharge-Level Estimates

Next, this tutorial will demonstrate how to weight discharge-level data to produce national estimates.

To estimate the number of hospital discharges nationwide with a principal diagnosis of asthma, weight the data by using the WEIGHT keyword in SAS, and the discharge weight data element in the PROC SURVEYMEANS step. For analysis including 2012 and earlier years, replace HOSP_NIS with HOSPID and use the NIS Trend Weight (TRENDWT), available in the resources section of this tutorial, in place of the original discharge weight (DISCWT) for years prior to 2012. See the Multi-Year Analysis Tutorial for more information.


Title1 'Produce National Estimate of Discharges with DXCCS1=128 (Asthma) from 2014 NIS File (Weighted)';
Libname nis2014	"C:\NIS\2014\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;

 data asthma;
set NIS2014.nis_2014_core 
    (keep=KEY_NIS HOSP_NIS DISCWT NIS_STRATUM DXCCS1);
    Attrib DXCCS1 Length=3 Label='Asthma DXCCS1=128';
    if DXCCS1 eq 128 then Asthma = 1;
    else Asthma = 0;
 run;

 PROC SURVEYMEANS DATA=asthma SUM STD MEAN STDERR ;
    VAR Asthma;
    WEIGHT DISCWT;
    CLUSTER HOSP_NIS ;
    STRATA NIS_STRATUM ;
run;
 
In this example, the result is 339,890 - an estimate of the number of hospital discharges, nationwide, with a principal diagnosis of asthma in 2014.


        
Produce National Estimate of Discharges with DXCCS1=128 (Asthma) from 2014 NIS File (Weighted)
                                                                                           
                                The SURVEYMEANS Procedure                                  
                                                                                           
                                     Data Summary                                        
                                                                                           
			Number of Strata                 206                            
			Number of Clusters              4411                            
			Number of Observations       7071762                            
			Sum of Weights              35358818                            
                                                                                           
                                                                                           
                                    Statistics                                         
                                                                                           
                                Std Error                       Std Error          
Variable            Mean         of Mean             Sum          of Sum          
------------------------------------------------------------------------          
Asthma          0.009613        0.000177          339890     6503.481491          
------------------------------------------------------------------------          

 
One way to verify that you have weighted the data correctly, would be to compare estimates with those generated by a simple query on HCUPnet, the online system which provides quick access to national and regional estimates using HCUP data.

  1. Go to HCUPnet and select "Create a New Analysis."

  2. Click on "Inpatient"

  3. Click on "Descriptive Statistics".

  4. Choose a year - click on "2014".

  5. Under "Do you want data on a specific diagnosis or procedure?" - click on "Yes"

  6. Under "Choose how you want to classify diagnosis or procedure:". Click on "Diagnoses - Clinical Classification Software (CCS)".

  7. Under "Choose your diagnosis" type in "128 Asthma".

  8. Click on "Create Analysis".

  9. Under Data Use Agreement for HCUPnet, read the text and then click "Accept".

The HCUPnet results and your results using SAS® should be the same - in this case 339,890 discharges with a principal diagnosis of asthma (CCS 128) using the original NIS discharge weight (DISCWT). The nationwide statistics in HCUPnet for years prior to 2012 were regenerated using new trend weights to permit longitudinal analysis. Note that since the NIS contains close to a 20 percent sample of all US hospital discharges, another simple check on the accuracy of your weighted estimate is to multiply the number of unweighted discharges by 5.

  Produce National Estimate of Discharges with DXCCS1=128 (Asthma) from 2014 NIS File (Weighted)
                                                                                           
				The SURVEYMEANS Procedure                                  
                                                                                           
					Data Summary                                        
                                                                                           
			Number of Strata                 206                            
			Number of Clusters              4411                            
			Number of Observations       7071762                            
			Sum of Weights              35358818                            
                                                                                           
                                                                                           
					Statistics                                         
                                                                                           
                                Std Error                       Std Error          
Variable            Mean         of Mean             Sum          of Sum          
------------------------------------------------------------------------          
Asthma          0.009613        0.000177          339890     6503.481491          
------------------------------------------------------------------------          


NIS Regional Discharge-Level Estimates

You might want to also produce regional estimates of hospital discharges with a diagnosis of asthma once you have weighted the data. If so, one method for producing these estimates is to use information contained in the HOSP_REGION data element which is available in the hospital file. Then, you can use the DOMAIN SAS keyword to indicate that you want to produce estimates of asthma discharges by region of hospital. The resulting output will contain separate discharge sum estimates for each Census Region (1: Northeast, 2: Midwest, 3: South, and 4: West).

Regional Estimates

Title1 "Produce Regional Estimates of Discharges with DXCCS1=128 (Asthma) from 2014 NIS File (Weighted)";
Libname nis2014 "C:\NIS\2014\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;
     
 Proc Format;
     Value St_Regn 
         1 = '1: Northeast'
         2 = '2: Midwest  '
         3 = '3: South    '
         4 = '4: West     '
     ;
 Run;
 data asthma;
Merge NIS2014.nis_2014_core 
          (keep=KEY_NIS HOSP_NIS DISCWT NIS_STRATUM DXCCS1)
          NIS2014.nis_2014_Hospital (keep=HOSP_NIS HOSP_REGION)
    ;
    By HOSP_NIS;
    Attrib Asthma Length=3 Label='Asthma DXCCS1=128' ;
    if dxccs1 eq 128 then asthma = 1;
    else asthma = 0; 
    format HOSP_REGION st_regn.;
 run;

 PROC SURVEYMEANS DATA=asthma SUM STD MEAN STDERR ;
    VAR asthma;
    WEIGHT DISCWT ;
    CLUSTER HOSP_NIS ;
    STRATA NIS_STRATUM ; 
    DOMAIN HOSP_REGION ;
run;


Produce Regional Estimates of Discharges with DXCCS1=128 (Asthma) from 2014 NIS File (Weighted)     
                                                                                                         
                                  The SURVEYMEANS Procedure                                        
                                                                                                         
                                        Data Summary                                              
                                                                                                         
			Number of Strata                 206                                  
			Number of Clusters              4411                                  
			Number of Observations       7071762                                  
			Sum of Weights              35358818                                  
                                                                                                         
                                                                                                         
                                       Statistics                                                
                                                                                                         
                                                     Std Error                       Std Error      
Variable    Label                        Mean         of Mean             Sum          of Sum      
---------------------------------------------------------------------------------------------      
Asthma      Asthma DXCCS1=128        0.009613        0.000177          339890     6503.481491
---------------------------------------------------------------------------------------------      
Produce Regional Estimates of Discharges with DXCCS1=128 (Asthma) from 2014 NIS File (Weighted)     
                                                                                                         
                               The SURVEYMEANS Procedure                                        
                                                                                                         
                            Domain Statistics in HOSP_REGION                                    
                                                                                                         
  Region                                                                                                 
      of                                                   Std Error                     Std Error
hospital     Variable  Label                      Mean       of Mean           Sum          of Sum
--------------------------------------------------------------------------------------------------
1: Northeast  Asthma   Asthma DXCCS1=128      0.012498      0.000608         82785     4197.759358
2: Midwest    Asthma   Asthma DXCCS1=128      0.009817      0.000363         77975     2895.229590
3: South      Asthma   Asthma DXCCS1=128      0.009010      0.000219        124110     3253.093753
4: West       Asthma   Asthma DXCCS1=128      0.007840      0.000337         55020     2389.375342
--------------------------------------------------------------------------------------------------
 
Check your results using HCUPnet. The first part of the query on HCUPnet will be the same as that which you performed for the national estimate.

  1. Go to HCUPnet and select "Create a New Analysis".

  2. Click on "Inpatient".

  3. Click on "Descriptive Statistics".

  4. Choose a year - click on "2014".

  5. Under "Do you want data on a specific diagnosis or procedure?" - click on "Yes".

  6. Under "Choose how you want to classify diagnosis or procedure:". Click on "Diagnoses - Clinical Classification Software (CCS)".

  7. Under "Choose your diagnosis", type in "128 Asthma".

  8. Click on "Create Analysis".

  9. Under Data Use Agreement for HCUPnet, read the text and then click "Accept".

  10. In terms of patient and hospital characteristics, this time you want to see the discharges by region, so in the left panel under "GET MORE DETAILS, under Hospital/Facility Char." select "Region of the US", then click on "Submit Request".

The HCUPnet results and your results using SAS® should be the same.

Return to Contents


NIS Discharge Weights over Time

NIS data are available annually going back to 1988. The NIS discharge weight data element has changed over time.

For trend analysis spanning 2012 and earlier years, use the NIS Trend Weight (TRENDWT) available at https://www.hcup-us.ahrq.gov/db/nation/nis/trendwghts.jsp, in place of the original discharge weight (DISCWT) included in the NIS core file for years prior to 2012. See the Multi-Year Analysis Tutorial for more information.

Return to Contents

Years Variable Name Use
2001 and later DISCWT All national estimates
2000 DISCWT National estimates except those including total charge
2000 DISCWTcharge National estimates of total charge
1998-1999 DISCWT All national estimates
1998 DISCWT_U All national estimates


NIS Hospital Weights

Prior to 2012, to produce hospital-level estimates you must apply hospital weights to the data.

For NIS years 2012 onwards, hospital weights are discontinued.

Prior to 2012, hospital weights are calculated for each of the NIS strata. Within each of the strata, each hospital's weight is equal to the number of universe hospitals it represents. Because twenty percent of the AHA universe hospitals are sampled in each stratum when possible, the hospital weights are usually near five (as each hospital sampled represents about 5 hospitals).

The NIS 1988-2011 hospital weights are available in the data element hospital weight (HOSPWT) and are stored in each hospital record in the Hospital File. When the hospital weights are applied to the unweighted NIS hospital observations, the result is the number of hospitals for the entire universe - in the case of the 1988-2011 NIS, the universe is all US community hospitals, excluding rehabilitation hospitals beginning with 1998, and excluding Long-Term Acute Care (LTAC) hospitals beginning with 2012.

Return to Contents


Data Elements for NIS Hospital Weights Over Time

Note that the variable for hospital weights depends on the data year. For the 1998-2011 NIS and later years, HOSPWT should be used to create nationwide hospital-level estimates. For NIS databases prior to 1998, the variable HOSPWT_U should be used. Beginning with the 2012 NIS redesign, the hospital weights are no longer needed.

Return to Contents


NIS Unweighted Analysis

You may be interested in using the NIS as an unweighted sample.

In most cases, it is critical to weight the data to produce accurate, unbiased results. However, if your research does not necessitate creating national or regional estimates, do not use any weights in your programming.

An analysis of the association between hospital-level results from a survey on safe hospital practices and hospital-level inpatient risk-adjusted mortality scores is one example of research for which it would be appropriate to use the NIS as an unweighted sample. Visit here Exit Disclaimer to access this study.

Return to Contents


NIS Summary

Let's review what you need to do to create national or regional estimates using NIS data.

Weight your data with discharge weights (DISCWT) for discharge-level analyses. Prior to 2012, weight your data with hospital weights (HOSPWT) for hospital level estimates.

For trend analysis including 2012 and earlier years, use the NIS Trend Weight (TRENDWT) in place of the original discharge weight (DISCWT) for years prior to 2012.

Check the weighted data totals by performing a quick query on HCUPnet.

In most cases, weighting the data is critical to producing accurate and unbiased results. However, if you are not producing national or regional estimates, but rather are using the NIS as an unweighted sample, do not apply weights to your data.

If you are interested in a more detailed statistical explanation of weighting, refer to the documentation available on the HCUP-US website, particularly the report entitled Calculating National Inpatient Sample (NIS) Variances for Data Years 2012 and Later.

Remember that the nationwide databases cannot be used to conduct State-level analyses. If you are interested in performing State-level analyses, use the HCUP State-specific databases.

Return to Contents


Nationwide Emergency Department Sample (NEDS)

The Nationwide Emergency Department Sample (NEDS) is a database of emergency department visits - visits for which the patient was treated and released as well as visits that resulted in a hospital admission - going back annually to 2006.

The NEDS can be used to produce national and regional estimates of emergency department care, utilization, access, costs and quality.

To produce national or regional estimates of the NEDS data contained in the Core File, you must weight the unweighted observations.

Return to Contents


NEDS Discharge Weights

The weights you apply will depend on the type of analysis you are performing. The NEDS data can be weighted to produce emergency department or ED visit level estimates or hospital-level estimates.

To produce discharge-level estimates, such as estimating the number of emergency department visits for influenza in the United States or estimating the number of emergency department visits for hip fractures among the older adults in the United States, you must apply a discharge weight to each record in the Core File.

The discharge weights were calculated for NEDS data by first stratifying the NEDS hospitals on the same variables that were used for creating the sample.

These variables were geographic region, trauma center designation, urban/rural location, teaching status, and ownership.

A weight was then calculated for each stratum by dividing the number of universe of ED visits in that stratum - obtained from AHA data - by the number of NEDS of ED visits in the stratum.

Weighted estimates can be calculated by applying the discharge weights to the sample of ED visits.

Weights have been assigned to each record. In each record in the NEDS Core File, the weight is stored in the data element DISCWT. When the discharge weights are applied to the unweighted NEDS observations, the result is an estimate of the number of of ED visits for the entire universe. In the case of the NEDS, the universe is emergency department visits nationwide.

Return to Contents


NEDS Unweighted Discharge Record Count

Again, let's demonstrate weighting the data using SAS. As with the NIS data, begin by running a simple program to see how many records there are in the NEDS for which influenza is indicated as a first-listed diagnosis.

Unweighted Record Count


Title1 'Count Records with DXCCS1=123 (Influenza) from 2014 NEDS File'; 
libname neds2014 "C:\NEDS\2014\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;

 data influenza1;
    set NEDS2014.neds_2014_core (keep=HOSP_ED DISCWT NEDS_STRATUM DXCCS1);
    Attrib Influenza length=3 label="Influenza DXCCS1=123";
    if dxccs1 eq 123 then influenza = 1; 
    else influenza = 0;
 run;

 PROC SURVEYMEANS DATA=influenza1 SUM STD MEAN STDERR ;
    VAR influenza; 
    CLUSTER HOSP_ED ; 
    STRATA NEDS_STRATUM ;
run;
  1. Also note, that influenza is CCS code 123, so I am looking for records in which DXCCS1=123.

  2. 
    data influenza1;
        set NEDS2014.neds_2014_core (keep=HOSP_ED DISCWT NEDS_STRATUM DXCCS1);
        Attrib Influenza length=3 label="Influenza DXCCS1=123";
        if dxccs1 eq 123 then influenza = 1; 
        else influenza = 0;
     run;
    
    
  3. Here is the resulting output. Again, the data summary provides a quick summary of the number of strata, clusters, and total observations in the data.
  4. 
    Count Records with DXCCS1=123 (Influenza) from 2014 NEDS File
                                                                                                     
                                        The SURVEYMEANS Procedure                                    
                                                                                                     
                                               Data Summary                                          
                                                                                                     
                                   Number of Strata                  86                              
                                   Number of Clusters               945                              
                                   Number of Observations      31026417                              
                                                                                                     
                                                                                                     
                                               Statistics                                            
                                                                                                     
                                                            Std Error                       Std Error
    Variable     Label                           Mean         of Mean             Sum          of Sum
    -------------------------------------------------------------------------------------------------
    Influenza    Influenza DXCCS1=123        0.005664        0.000132          175741     5960.015903
    -------------------------------------------------------------------------------------------------
    
  5. The summary shown here confirms a database that contains 86 strata, 945 clusters - each cluster representing a single hospital emergency department, and 31,026,417 records, the number of records in the 2014 NEDS.

  6. The statistics section provides the results of the analysis: 175,741 records. This is the number of records in the NEDS for which influenza is indicated as a first-listed diagnosis. This is not an estimate of the number of emergency department visits nationwide for influenza.


NEDS National Discharge-Level Estimates

To estimate the number of nationwide emergency department visits with a first-listed diagnosis of influenza, weight the data using the WEIGHT keyword in the PROC SURVEYMEANS step.

National Estimate


Title1 'Produce National Estimate of Discharges with DXCCS1=123 (Influenza) from 2014 NEDS File (Weighted)';
libname neds2014 "C:\NEDS\2014\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;

 data influenza1;
    set NEDS2014.neds_2014_core (keep=HOSP_ED DISCWT NEDS_STRATUM DXCCS1); 
    Attrib Influenza length=3 label="Influenza DXCCS1=123";
    if dxccs1 eq 123 then influenza = 1; else influenza = 0;
 run;

 PROC SURVEYMEANS DATA=influenza1 SUM STD MEAN STDERR ;
    VAR  influenza; 
    WEIGHT DISCWT; 
    CLUSTER HOSP_ED ; 
    STRATA NEDS_STRATUM ;
run;




The result is 782,665, an estimate of the number of nationwide emergency department visits, both those in which the patient was treated and released and those that resulted in a hospital admission, with a first-listed diagnosis of influenza in 2014.


Produce National Estimate of Discharges with DXCCS1=123 (Influenza) from 2014 NEDS File (Weighted)
                                                                                                   
                                   The SURVEYMEANS Procedure                                      
                                                                                                   
                                          Data Summary                                            
                                                                                                   
				Number of Strata                  86                                
				Number of Clusters               945                                
				Number of Observations      31026417                                
				Sum of Weights             137807901                                
                                                                                                   
                                                                                                   
                                            Statistics                                              
                                                                                                   
                                                        Std Error                       Std Error  
Variable     Label                           Mean         of Mean             Sum          of Sum  
-------------------------------------------------------------------------------------------------  
Influenza    Influenza DXCCS1=123        0.005679        0.000133          782665           26109  
-------------------------------------------------------------------------------------------------  
To verify that you have weighted the data correctly, perform a query on HCUPnet.

  1. Go to HCUPnet and Click on "Create a New Analysis".

  2. Click on "Emergency Department"

  3. Click on "Descriptive Statistics"

  4. Choose a year – click on 2014. Click on "Yes"

  5. Under "Choose how you want data on a specific diagnoses or procedures" click the drop-down box and select "Diagnosis - Clinical Classification Software (CCS)"

  6. Under"Choose your diagnosis" type in "123 - Influenza".

  7. Click on "Create Analysis".

  8. Under Data Use Agreement for HCUPnet, read the text and then click "Accept".

Analysis will generate and display. The HCUPnet results and the SAS results should be the same.

Return to Contents


NEDS Regional Discharge-Level Estimates

Once you have weighted the data, you might want to also produce regional estimates of emergency department visits for influenza. If so, one method for producing these estimates is to use information contained in the HOSP_REGION data element which are available in the hospital file. Then, use the SAS DOMAIN keyword to indicate that you want to produce estimates of influenza ED visits by region of hospital.

Regional Estimates


Title1 'Produce Regional Estimates of Discharges with DXCCS1=123 (Influenza) from 2014 NEDS File (Weighted)';
libname neds2014 'C:\NEDS\2014\SASDATA';
options obs = MAX PageSize=51 LineSize=146 ;

  Proc Format;
     Value St_Regn 
         1 = '1: Northeast'
         2 = '2: Midwest  '
         3 = '3: South    '
         4 = '4: West     '
     ;
 Run;
 data influenza1;
    Merge NEDS2014.neds_2014_core (keep=HOSP_ED DISCWT NEDS_STRATUM DXCCS1)
          NEDS2014.neds_2014_Hospital (keep=HOSP_ED HOSP_REGION)
    ;
    By HOSP_ED;
    Attrib Influenza length=3 label="Influenza DXCCS1=123";
    if dxccs1 eq 123 then influenza = 1;
    else influenza = 0; 
    format HOSP_REGION st_regn.;
 run;

 PROC SURVEYMEANS DATA=influenza1 SUM STD MEAN STDERR ;
    VAR influenza;
    WEIGHT DISCWT ;
    CLUSTER HOSP_ED ;
    STRATA NEDS_STRATUM ;
    DOMAIN HOSP_REGION ;
run;
     
Check your results by running a quick query on HCUPnet.

  1. The first part of the query on HCUPnet will be the same as the query performed previously for the national estimate of emergency department visits with a first-listed diagnosis of influenza.

  2. Go to HCUPnet home screen, click on "Create a New Analysis".

  3. Click on "Emergency Department".

  4. Click on "Descriptive Statistics".

  5. Select year - 2014. Under "Do you want data on a specific diagnosis or procedure?", click "Yes".

  6. Under "Choose how you want to classify diagnoses or procedures", Select "Diagnoses-Clinical Classification Software (CCS)".

  7. Under "Choose your diagnosis", type in "123 Influenza", and then click "Create Analysis".

  8. Under Data Use Agreement for HCUPnet, read the text and then click "Accept".

  9. The analysis will generate. On the left side bar, and under "GET MORE DETAILS", "Hospital/Facility Char.", select "Region of the U.S.". Then click on "Submit Request".

The HCUPnet results and the SAS results should be the same.

Return to Contents


NEDS Weighted Hospital-Level Estimates

To produce hospital-level estimates, such as the number of U.S. emergency departments with a trauma center designation, you will apply a hospital weight (HOSPWT) to the data.

HOSPWT was also calculated according to the NEDS strata. Within each of the strata, each hospital's weight is equal to the number of universe hospitals it represents during the year.

To demonstrate how to weight at the hospital level, let's consider an analysis that requires us to estimate the number of emergency departments in the United States with a trauma center.

Return to Contents


NEDS Unweighted Hospital ED Record Count

First, tabulate the number of emergency department records in the NEDS from hospitals which are classified as having a trauma center.

Record Count


Title1 'Count Hospital Records with HOSP_TRAUMA>0 from 2014 NEDS HOSPITAL File';
libname neds2014 "C:\NEDS\2014\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;

 data TRAUMA1;
set NEDS2014.neds_2014_hospital 
      (keep=HOSP_ED DISCWT NEDS_STRATUM HOSP_TRAUMA);
    Attrib Trauma length=3 label='Trauma Hospital';
    if hosp_trauma>0 then trauma = 1;
    else trauma = 0;
 run;

 PROC SURVEYMEANS DATA=TRAUMA1 SUM STD MEAN STDERR ;
    VAR trauma;
    CLUSTER HOSP_ED ;
    STRATA NEDS_STRATUM ;
run;
The result is 203. This is the number of emergency department records in the NEDS from hospitals with a trauma center.

Unweighted Record Count


Count Hospital Records with HOSP_TRAUMA>0 from 2014 NEDS HOSPITAL File
                                                                                           
                                 The SURVEYMEANS Procedure                                 
                                                                                           
                                        Data Summary                                       
                                                                                           
			Number of Strata                  86                           
			Number of Clusters               945                           
			Number of Observations           945                           
                                                                                           
                                                                                           
                                        Statistics                                         
                                                                                           
                                                  Std Error                       Std Error
Variable    Label                      Mean         of Mean             Sum          of Sum
-------------------------------------------------------------------------------------------
Trauma      Trauma Hospital        0.214815               0      203.000000               0
-------------------------------------------------------------------------------------------
    


NEDS National Hospital ED-Level Estimate

To estimate the number of hospitals with a trauma center nationwide you will weight the data.

National Estimate


Title1 'Produce National Estimate of Hospitals with HOSP_TRAUMA>0 from 2014 NEDS HOSPITAL File (Weighted)';
libname neds2014 "C:\NEDS\2014\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;

 data TRAUMA1;
set NEDS2014.neds_2014_hospital 
      (keep=HOSP_ED HOSPWT NEDS_STRATUM HOSP_TRAUMA);
    Attrib Trauma length=3 label='Trauma Hospital';
    if hosp_trauma>0 then trauma = 1; 
    else trauma = 0;
 run;

 PROC SURVEYMEANS DATA=TRAUMA1 SUM STD MEAN STDERR ;
    VAR trauma; 
    WEIGHT HOSPWT; 
    CLUSTER HOSP_ED ;
    STRATA NEDS_STRATUM ;
run;
    
The result is 958. This is an estimate of the number of hospitals with a trauma center nationwide.

Unweighted Record Count


Produce National Estimate of Hospitals with HOSP_TRAUMA>0 from 2014 NEDS HOSPITAL File (Weighted)
                                                                                                           
                                   The SURVEYMEANS Procedure                                               
                                                                                                           
	                                      Data Summary                                                     
                                                                                                           
			Number of Strata                  86                                         
			Number of Clusters               945                                         
			Number of Observations           945                                         
			Sum of Weights                  4594                                         
                                                                                                         
                                                                                                         
                                        Statistics                                                       
                                                                                                                    
                                                  Std Error                       Std Error              
Variable    Label                      Mean         of Mean             Sum          of Sum              
-------------------------------------------------------------------------------------------              
Trauma      Trauma Hospital        0.208533               0      958.000000    3.044201E-15              
-------------------------------------------------------------------------------------------              
    


NEDS Unweighted Analysis

Depending on the nature of your research, you may be interested in using the NEDS as an unweighted sample.

Examples of an analysis in which it would be appropriate to use the NEDS as an unweighted sample include a hospital-level study in which NEDS emergency department-level data are linked to data on pre-hospital care, such as that provided by emergency medical services.

In most cases, it is critical to weight the data to produce accurate, unbiased results. However, if your research does not necessitate creating national or regional estimates, do not use any weights in your programming.

Return to Contents


NEDS Summary

Let's review what you need to do to create national or regional estimates using NEDS data.

Weight your data with discharge weights for ED visit-level analyses.

Weight your data with hospital weights for hospital-level estimates.

Check the weighted data totals by performing a quick query on HCUPnet.

In most cases, weighting the data is critical to producing accurate and unbiased results. However, if you are not producing national or regional estimates, but rather are using the NEDS as an unweighted sample, do not apply weights to your data.

Remember that the nationwide databases cannot be used to conduct State-level analyses. If you are interested in performing State-level analyses, use the HCUP State-specific databases.

Return to Contents


Kids' Inpatient Database (KID)

A third nationwide database is the KID, a database designed specifically for the study of pediatric conditions that require hospitalization.

The KID is produced every 3 years starting with the 1997 data year.

To produce national or regional estimates using KID data, you must weight the unweighted observations in the KID file.

Return to Contents


KID Discharge Weights

KID data must be weighted to perform discharge-level analyses. Because of the sample design of the KID, it cannot be used as an unweighted database. For more information on the unique sample design of the KID, see the Sample Design tutorial.

Discharge weights were calculated for KID data by stratifying the hospitals on the same variables that were used for creating the sample and then creating weights by stratum.

The stratifying variables were geographic region, urban/rural location, teaching status, bed size, ownership, and children's hospital.

Remember that the KID was designed for the study of pediatric hospitalizations, and that the discharges in the KID are a combination of newborn discharges (including both complicated and uncomplicated) and non-newborn pediatric discharges. Because of this, for each stratum, weights were created for both newborn discharges and non-newborn pediatric discharges.

The weights were created for newborn discharges (both complicated and non-complicated) by dividing the number of universe newborns in the stratum by the number of KID newborns in the stratum.

And the weights were created for non-newborn discharges by dividing the number of universe non-newborn pediatric discharges in the stratum by the number of KID non-newborn discharges in the stratum.

Weighted estimates can be calculated by applying the discharge weights to the sample discharges.

Return to Contents


KID Unweighted Discharge Record Count

Today, let's demonstrate how to weight the KID to produce national and regional estimates of pediatric hospital discharges with a principal diagnosis of cystic fibrosis, a discharge-level estimate. Remember that pediatric discharges are defined as those for which the patient was age 20 or younger at admission.

First, tabulate the number of records in the KID for which cystic fibrosis is indicated as a principal diagnosis.

Unweighted Record Count


Title1 'Count Records with DXCCS1=56 (Cystic Fibrosis) from 2012 KID File'; 
libname kid2012 "C:\KID\2012\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;

 data cf1;
    set KID2012.kid_2012_core (keep=HOSP_KID DISCWT DXCCS1 KID_STRATUM);
    attrib cysticf length=3 label='Cystic Fibrosis';
    if dxccs1 eq 56 then cysticf = 1;
    else cysticf = 0;
 run;

 PROC SURVEYMEANS DATA=cf1 SUM STD MEAN STDERR ;
    VAR  cysticf;
    CLUSTER HOSP_KID ;
    STRATA KID_STRATUM ;
run;
In other words, records in which DXCCS1 equals 56.

Here is the resulting output. Again, the data summary provides a quick summary of the number of strata, clusters, and total observations in the data.



Count Records with DXCCS1=56 (Cystic Fibrosis) from 2012 KID File             
                                                                                           
                                 The SURVEYMEANS Procedure                                 
                                                                                           
                                        Data Summary                                       
                                                                                           
			Number of Strata                  95                           
			Number of Clusters              4179                           
			Number of Observations       3195782                           
                                                                                           
                                                                                           
                                        Statistics                                         
                                                                                           
                                                  Std Error                       Std Error
Variable    Label                      Mean         of Mean             Sum          of Sum
-------------------------------------------------------------------------------------------
cysticf     Cystic Fibrosis        0.001524        0.000120     4871.000000      430.795123
-------------------------------------------------------------------------------------------


The summary shown here confirms a database that contains 95 strata, 4,179 clusters, each cluster representing a single hospital, and 3,195,782 records, the number of records in the 2012 KID.

The statistics section provides the results of the analysis: 4,871 records. This is the number of records in the KID for which cystic fibrosis is indicated as a principal diagnosis. This is not an estimate of pediatric discharges nationwide for cystic fibrosis.


KID National Discharge-Level Estimates

Weight the data using the WEIGHT keyword in the PROC SURVEYMEANS step.

National Estimate


Title1 'Produce National Estimate of Discharges with DXCCS1=56 (Cystic Fibrosis) from 2012 KID File (Weighted)';
libname kid2012 "C:\KID\2012\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;

 data cf1;
    set KID2012.kid_2012_core (keep=HOSP_KID DISCWT DXCCS1 KID_STRATUM);
    attrib cysticf length=3 label='Cystic Fibrosis';
    if dxccs1 eq 56 then cysticf = 1;
    else cysticf = 0;
 run;

 PROC SURVEYMEANS DATA=cf1 SUM STD MEAN STDERR ;
    VAR cysticf;
    WEIGHT DISCWT;  
    CLUSTER HOSP_KID ;
    STRATA KID_STRATUM ;
run;

The result is 7,007. This is an estimate of the number of pediatric hospital discharges, nationwide, with a principal diagnosis of cystic fibrosis in 2012.


Produce National Estimate of Discharges with DXCCS1=56 (Cystic Fibrosis) from 2012 KID File (Weighted)
                                                                                                      
					The SURVEYMEANS Procedure                                        
                                                                                                      
						Data Summary                                              
                                                                                                      
				Number of Strata                  95                                  
				Number of Clusters              4179                                  
				Number of Observations       3195782                                  
				Sum of Weights            6675221.93                                  
                                                                                                      
                                                                                                      
						Statistics                                                
                                                                                                      
                                                      Std Error                       Std Error       
    Variable    Label                      Mean         of Mean             Sum          of Sum       
    -------------------------------------------------------------------------------------------       
    cysticf     Cystic Fibrosis        0.001050     0.000086055     7007.542680      619.403091       
    -------------------------------------------------------------------------------------------  
  
Note that HCUPnet contains a path for querying National Statistics on Children. However, this path provides statistics only on discharges in which the patient was age 17 or less. The KID file from the Central Distributor provides data on discharges where patients are age 20 or less. As a result, the weighted estimates produced will not exactly match those provided by HCUP net, unless you limit the data set to those age 17 or less when creating your estimate of discharges. That said, use HCUP net to get a ballpark idea of what the estimate should be.

  1. On HCUPnet, select "Inpatient".

  2. Select "Descriptive Statistics".

  3. Under "Choose a Year" select "2012".

  4. Under "Do you want data on a specific diagnosis or procedure?", click "Yes".

  5. Use CCS codes to identify cystic fibrosis patients, so select "Diagnoses-Clinical Classifications Software".

  6. Then type in "56 for cystic fibrosis" and then click on "Create Analysis".

  7. Under Data Use Agreement for HCUPnet, read the text and then click "Accept".

  8. On the left side bar, under "Focus on Subgroups of Interest" select "Children only (age 0-17)". Click on "Submit Request".

The HCUPnet total will be smaller than the total produced from the KID database because the HCUPnet estimate is limited to pediatric patients ages 0-17, but the KID database includes patients ages 0-20. To match the HCUPnet totals, limit the records from the KID database to ages 0-17.

Produce National Estimate of Discharges with DXCCS1=56 (Cystic Fibrosis) from 2012 KID File (Weighted)
                                                                                                      
					The SURVEYMEANS Procedure                                        
                                                                                                      
						Data Summary                                              
                                                                                                      
				Number of Strata                  95                                  
				Number of Clusters              4179                                  
				Number of Observations       3195782                                  
				Sum of Weights            6675221.93                                  
                                                                                                      
                                                                                                      
						Statistics                                                
                                                                                                      
                                                      Std Error                       Std Error       
    Variable    Label                      Mean         of Mean             Sum          of Sum       
    -------------------------------------------------------------------------------------------       
    cysticf     Cystic Fibrosis        0.001050     0.000086055     7007.542680      619.403091       
    -------------------------------------------------------------------------------------------      


KID Regional Discharge-Level Estimates

Once you have weighted the data, you might want to also produce regional estimates of the number of discharges for cystic fibrosis. For the KID, the data elements KID_STRATUM and HOSP_REGION are on the Core file. Use the SAS DOMAIN keyword to indicate that you want to produce estimates of cystic fibrosis discharges by region.

Regional Estimate


Title1 'Produce Regional Estimates of Discharges with DXCCS1=56 (Cystic Fibrosis) from 2012 KID File (Weighted)';
libname kid2012 "C:\KID\2012\SASDATA";
options obs = MAX PageSize=51 LineSize=146 ;

 Proc Format;
     Value St_Regn 
         1 = '1: Northeast'
         2 = '2: Midwest  '
         3 = '3: South    '
         4 = '4: West     '
     ;
 Run;

 data cf1;
Set KID2012.kid_2012_core 
        (keep=HOSP_KID DISCWT DXCCS1 KID_STRATUM HOSP_REGION);
    attrib cysticf length=3 label='Cystic Fibrosis';
    if dxccs1 eq 56 then cysticf = 1;
    else cysticf = 0; 
    format HOSP_REGION st_regn.;
 run;

 PROC SURVEYMEANS DATA=cf1 SUM STD MEAN STDERR ;
    VAR cysticf;
    WEIGHT DISCWT ;
    CLUSTER HOSP_KID ;
    STRATA KID_STRATUM ; 
    DOMAIN HOSP_REGION ;
run;




Produce Regional Estimates of Discharges with DXCCS1=56 (Cystic Fibrosis) from 2012 KID File (Weighted)
                                                                                                        
					The SURVEYMEANS Procedure                                        
                                                                                                        
                                              Data Summary                                              
                                                                                                        
				Number of Strata                  95                                  
				Number of Clusters              4179                                  
				Number of Observations       3195782                                  
				Sum of Weights            6675221.93                                  
                                                                                                        
                                                                                                        
                                              Statistics                                                
                                                                                                        
                                                  Std Error                       Std Error       
Variable    Label                      Mean         of Mean             Sum          of Sum       
-------------------------------------------------------------------------------------------       
cysticf     Cystic Fibrosis        0.001050     0.000086055     7007.542680      619.403091       
-------------------------------------------------------------------------------------------       
 Produce Regional Estimates of Discharges with DXCCS1=56 (Cystic Fibrosis) from 2012 KID File (Weighted)
                                                                                                        
                                       The SURVEYMEANS Procedure                                        
                                                                                                        
                                    Domain Statistics in HOSP_REGION                                    
                                                                                                        
  Region                                                                                              
      of                                                  Std Error                     Std Error 
hospital     Variable    Label                   Mean       of Mean             Sum        of Sum 
------------------------------------------------------------------------------------------------- 
1: Northeast  cysticf    Cystic Fibrosis      0.000998      0.000238     1126.792347    303.412050 
2: Midwest    cysticf    Cystic Fibrosis      0.001194      0.000203     1732.064175    331.214915 
3: South      cysticf    Cystic Fibrosis      0.000981      0.000154     2515.637383    426.164244 
4: West       cysticf    Cystic Fibrosis      0.001066      0.000199     1633.048776    346.496124 


KID Weights over Time

The KID discharge weight variable has changed over time.
Years Variable Name Use
2003 and later DISCWT All national estimates
2000 DISCWT National estimates except those including total charge
2000 DISCWTcharge National estimates of total charge
1997 DISCWT_U All national estimates



KID Limitations

Unlike the NIS and NEDS data, KID data cannot be used to produce hospital-level estimates because the data for the KID was sampled at the discharge level rather than at the hospital level.

Note that the KID cannot be used as an unweighted sample because of the design of the database.

Return to Contents


KID Summary

Let's review what you need to do to create national or regional estimates using KID data.

Weight your data with discharge weights (DISCWT) for discharge-level estimates.

Check the weighted data totals by performing a quick query on HCUPnet.

Remember, as previously mentioned, HCUPnet contains a path for discharges in which the patient was age 17 or less. However, the KID file from the Central Distributor provides data on discharges where patients are age 20 or less.

Hospital-level analyses are not possible with KID data.

The KID cannot be used as an unweighted sample. Weights should always be applied to the data.

Remember that the nationwide databases cannot be used to conduct State-level analyses. If you are interested in performing State-level analyses, use the HCUP State-specific databases.

Return to Contents


Wrap Up - Key Points

In summary, weighting is a key concept when working with the HCUP nationwide databases.

Please click each box to learn more on what to do and what not to do.

What to do:

Remember that the NIS, NEDS, and KID are sample databases. Thus, to produce national or regional estimates from these databases, you must be sure to properly weight the data.

It is important that you select the proper weight based on the database, the year of data, and the type of analysis you are conducting.

Check your estimates against HCUPnet to ensure that you are using the weights appropriately and calculating estimates and variances accurately.

And keep in mind that proper statistical techniques must also be used to calculate standard errors and confidence intervals when using each of the nationwide databases. For detailed instructions, refer to the special report Calculating National Inpatient Sample (NIS) Variances for Data Years 2012 and Later on the HCUP-US website.

What not to do:

State-level analyses cannot be conducted with the nationwide HCUP databases because the sampling frames are not designed with State as a stratification variable. If you are interested in analyses by State, you should use the State-specific databases.

HCUPnet cannot be used to check unweighted estimates, because weights have been applied to the HCUPnet data.

Remember that the KID cannot be used as an unweighted database.

Return to Contents


Resources and Other Training

If you are looking for more information on the subject matter covered here, several resources are available on the HCUP User Support website (HCUP-US).

If you can't find what you need, feel free to email the HCUP Technical Assistance staff at hcup@ahrq.gov. AHRQ has senior research personnel available to answer technical questions you may have.

Thank you for accessing this module. There are several other HCUP online tutorials. Access these tutorials to learn if there are other topics that could be helpful to you.

If you have any feedback regarding this module, please email us at hcup@ahrq.gov.

Detailed documentation of HCUP resources is available on the HCUP-US website. To access documentation for each of the HCUP national databases, see:

Resources

Additional items of interest include:



Return to Contents



Internet Citation: Producing National HCUP Estimates - Accessible Version. Healthcare Cost and Utilization Project (HCUP). December 2018. Agency for Healthcare Research and Quality, Rockville, MD. www.hcup-us.ahrq.gov/tech_assist/nationalestimates/508_course/508course_2018.jsp.
Are you having problems viewing or printing pages on this website?
If you have comments, suggestions, and/or questions, please contact hcup@ahrq.gov.
Privacy Notice, Viewers & Players
Last modified 12/13/18