In this exercise, you will reproduce all of the statistics generated in Rate Exercise 4a as well as some additional statistics for All races combined and Hispanic ethnicity. You will do this using a merged variable, rather than complex selection statements.

Create a table showing frequencies and incidence rates (age-adjusted to the 2000 U.S. standard population) for malignant esophageal squamous cell carcinoma. Include only microscopically confirmed cases. Calculate these statistics for persons diagnosed from 1992 through 2021 in the SEER 12 Registries. Do not show statistics based on fewer than 10 cases.

Display the statistics by race, ethnicity, year of diagnosis, and sex. Show data for males and females separately but not combined. Show the following racial and ethnic combinations by creating a merged variable: "All races/ethnicities combined" "Non-Hispanic White", " Non-Hispanic Black", " Non-Hispanic American Indian/Alaska Natives (PRCDA)", " Non-Hispanic Asian or Pacific Islander", and "Total Hispanic". Include standard errors and confidence intervals in the table.

Define squamous cell carcinoma as: Histologic Type ICD-O-3 = 8070-8078,8083-8084

Key Points

  • For this exercise you can use the system-provided merged variable to show statistics for specific racial and ethnic combinations.
  • When producing statistics using SEER Incidence (or mortality) data for American Indians/Alaska Natives, SEER frequently only includes cases that are in a Purchased/Referred Care Delivery Area (PRCDA).
  • The merged variable will allow us to create statistics for additional racial/ethnic groups in the same analysis as the other races. This would be very difficult without using a merged variable. For example, the selection statement in Exercise 4a would make an All races estimate invalid. This is because our All races estimate should include all American Indian/Alaska Natives regardless of PRCDA.

Step 1:  Open Exercise 4a's Matrix and Extract the Session

  • Open the file saved in exercise 4a. The filename should be "rate exercise 4a.sim".
  • If you did not save the output for exercise 4a you may open our version of Rate Exercise 4a.
  • SEER*Stat matrix files include the session information used to generate the table. This information serves as documentation for the results and provides a convenient method for generating similar statistics.
  • From the Matrix menu select Retrieve Session.
  • Two windows should now be open. Close the matrix window containing the results calculated in exercise 4a. You should now have one window labeled "Rate Session-x" where x is the number of rate session windows that you have created since starting SEER*Stat.

Step 2:  Modify the Selection Statement

  • On the Selection Tab, click the Clear button next to the "Race, Sex, Year Dx (Pop, Case Files)" box. In this exercise, we will use a merged variable to show statistics for specific racial and ethnic combinations.
  • Leave the existing selection statement in the "Other (Case Files)" box.

Step 3:  Create the Merged Variable to use on the Table Tab

For this exercise, you need to define a merged variable with 6 groupings: All races/ethnicities; Non-Hispanic White; Non-Hispanic Black; Non-Hispanic American Indians/Alaska Natives (PRCDA counties only); Non-Hispanic Asian or Pacific Islander; Hispanic (all races).

Open the Data Dictionary.

  1. Use the Merge... button to open the Edit Merged Variable window.
  2. Enter the following Name for the variable: "Race and origin (recommended by SEER)".
  3. Click the Add button to open the New Merged Grouping window. Notice that the New Merged Grouping window is the same as the Selection window used to define your analysis cohort on the Selection Tab. Selection statements are used to define the groupings of a merged variable.
  4. Add the first grouping (All races/ethnicities) by creating a selection statement for the subset of cases the new grouping will contain. Your selection statement should look like this:
    {Race, Sex, Year Dx.Race and origin recode (NHW, NHB, NHAIAN, NHAPI, Hispanic)} = 'Non-Hispanic White','Non-Hispanic Black','Non-Hispanic American Indian/Alaska Native','Non-Hispanic Asian or Pacific Islander','Hispanic (All Races)','Non-Hispanic Unknown Race'
  5. Click OK to return to the Edit Merged Variable window and enter the name for the new grouping.
  6. Click the Add button and repeat these steps to create the 5 additional groupings needed for this exercise:

    Non-Hispanic White
    {Race, Sex, Year Dx.Race and origin recode (NHW, NHB, NHAIAN, NHAPI, Hispanic)} = 'Non-Hispanic White'

    Non-Hispanic Black
    {Race, Sex, Year Dx.Race and origin recode (NHW, NHB, NHAIAN, NHAPI, Hispanic)} = 'Non-Hispanic Black'

    American Indian/Alaska Natives (PRCDA)
    {Race, Sex, Year Dx.Race and origin recode (NHW, NHB, NHAIAN, NHAPI, Hispanic)} = 'Non-Hispanic American Indian/Alaska Native'
    AND {Race, Sex, Year Dx.PRCDA 2020} = 'PRCDA'

    Non-Hispanic American Indians/Alaska Natives (PRCDA counties only)
    {Race, Sex, Year Dx.Race recode (W, B, AI, API)} = 'American Indian/Alaska Native'

    Non-Hispanic Asian or Pacific Islander
    {Race, Sex, Year Dx.Race and origin recode (NHW, NHB, NHAIAN, NHAPI, Hispanic)} = 'Non-Hispanic Asian or Pacific Islander'

    Hispanic (All Races)
    {Race, Sex, Year Dx.Race and origin recode (NHW, NHB, NHAIAN, NHAPI, Hispanic)} = 'Hispanic (All Races)'

  7. When you have finished defining the groupings for the merged variable, check Save to Dictionary so the variable is available for use in future sessions.
  8. Click the OK button to close the current window, and then close dictionary window.

Step 4:  Replace the Row Variable for Race (Table Tab)

  • Select "Race recode (W,B,AI,API) w/o unks" from the Display Variables box, and then click Remove.
  • Use the "+" symbol to expand the Merged category in the Available Variables box at the bottom of the Table Tab.
  • Select "Race and origin (recommended by SEER)" and add it to the row dimension.

Learn More...

  • Notice the icon that appears next to the "Race and origin (recommended by SEER)" variable under the Merged category in the Available variables box includes a thumbtack symbol. This indicates that the variable has been saved to the dictionary and will be available in future sessions that use the same database.

Step 5:  Specify a Title (Output Tab)

  • Move to the Output Tab.
  • Enter the following title:
  • Malignant Esophageal Squamous Cell Carcinoma
    Microscopically Confirmed Cases Only, 1992-2021
    Rate Exercise 4b
  • Ensure that the option to "Hide Statistics When Fewer Than 25 Cases" is checked.

Step 6:  Create and Save the Matrix

  • Execute the session.
  • A dialog will display the progress of the job. When the job completes, a SEER*Stat matrix window will open containing the output table.
  • Compare your results to this SEER*Stat matrix file: Exercise Matrix 4b Results.