nci logo
NIH
U.S. National Institutes of Health National Cancer Institute

SEER*Stat Survival Exercise 2

This exercise illustrates period survival statistics, which use only the most recent interval survival estimate of cases diagnosed in different calendar years (cross-sectional estimate of survival). Create a table with 5-year relative survival estimates for regional stage female breast cancer, calculated using the period method for 2005. Use 3-years of data per survival cohort. Use the SEER 17 registries and display annual statistics in the summary pages. Include the standard life (period) and period contributor tables in the output.

For more information about period survival and an overview of cancer survival statistics, see Cohort Definition Using Diagnosis Year on the Statistical Research and Applications Branch Web site.

Key Points and Reminders

  • SEER*Stat allows the calculation of period survival estimates for multiple years. This default and maximum estimate year will be the same as the last year of diagnosis minus one in the data.
  • When producing period survival statistics, you cannot use the year of diagnosis variable in case selections or in the table specifications. SEER*Stat will automatically select cases for the correct years of diagnosis based on the period year estimate, the maximum years of survival, and number of years of survival per cohort specified on the Parameters tab. In this exercise, we are calculating 5-year period survival using 3 year cohorts for 2005. The first interval (one year) will use cases diagnosed in 2003-2005, the second interval (two year) will use cases diagnosed in 2002-2004, and so forth until you have 5 years of survival. This analysis will therefore use 1999-2005 diagnoses.
    • The database used in this exercise contains cases diagnosed from 1973 through 2006, with registries contributing cases from varying years of diagnosis depending on when they joined and began contributing data to the SEER program. Data for four of the registries is not available prior to 2000. California excluding SF/SJM/LA, Kentucky, Louisiana, and New Jersey contribute cases for diagnosis years 2000-2005.
  • Because each period table is comprised of intervals obtained from various contributing life tables, SEER*Stat provides the option of displaying the contributing life tables. Please note that all period parameters are specified in years, but the computations are done monthly (in 12-month increments).

Step 1:  Create a New Survival Session

  • Start SEER*Stat.
  • From the File menu select New > Survival Session or use the Survival button on the toolbar.

Step 2:  Select a Database (Data Tab)

  • It is extremely important that you select the database as the first step. The other choices you will make in this session will be based on variables in the selected database. The correct database must be selected in order to see the correct list of variables in selection statements, table statements, and the dictionary editor.
  • On the Data Tab select "Incidence - SEER 17 Regs Limited-Use + Hurricane Katrina Impacted Louisiana Cases, Nov 2008 Sub (1973-2006 varying)"

Step 3:  Choose the Statistics to Display (Statistic Tab)

  • In the Cancer Survival Measures box, select Relative Survival.
  • Check the Period Survival box. When you enable this selection, a message dialog will appear warning you that the interval parameters have been changed to annual intervals.
  • In the Expected Rate Table drop down box, make sure "U.S. 1970,1980,1990,2000 (White, Black, Other (AI/API) All races for Other Unspec 1991+ and Unknown)" is selected.

Step 4:  Defining the Analysis Cohort (Selection Tab)

  • Specific click-by-click instructions for creating individual selection statements were given in previous tutorials (see Frequency Exercise 1a). Use those techniques to create three selection statements. Be sure to consider the following before making your selections:

    • Selection statements reduce the number of records included in an analysis based on specific variables. If no selection statements are made on the Selection Tab, all records in the database will be included. In this exercise, we want to calculate survival rates for regional stage female breast cancer. Therefore, we need a statement selecting based on stage, sex, and cancer site.
  • The Cases in Limited-Use Database option in the Standard Case Selections box will be checked by default. This will exclude the July through December 2005 cases from Louisiana.
  • Select Edit next to the Case Selection box to open the Case Selection window.
  • Using the controls at the top of the Case Selection window, you will create a search statement. The variables are listed in categories in the Variable box on the top left of the screen. The Selection Statement should read:
    {Race, Sex, Year Dx, Registry, County.Sex} = ' Female'
    And {Site and Morphology.Site rec with Kaposi and mesothelioma} = 'Breast'
    And {Stage.Summary stage 2000 (1998+)} = 'Regional'
  • When finished, click the OK button.

Step 5:  Set the Parameters (Parameters Tab)

  • Skip to the Parameters tab because there are no Table variables for this exercise.
  • Use the default setting for all parameters.
  • Check Standard Life (Period) and Period Contributors (Std Life) boxes in the Display section, and use the default Cumulative Summary settings.

Step 6:  Edit Settings on the Output Tab

  • Enter a title for your results matrix.
  • Use the default settings for the options on the Output tab.

Step 7:  Execute SEER*Stat

  • Use the Execute button or select Execute from the Session menu to execute the session. (Execute Offline is a 3rd option available and has been explained in previous exercises.)
  • You will receive a variable warning dialog instructing you to use caution when using the Summary stage 2000 (1998+) variable. This variable is coded for breast cancer for 1998+, therefore the warning does not apply for this analysis. Click OK to close the dialog and execute the session.
  • A dialog will display the progress of the job. When the job completes a new window will open containing the output table or matrix.

Learn More...

  • The SEER Program strives to make all Localized/Regional/Distant (L/R/D) stage variables consistent for all cancer sites for the appropriate years. However, there are certain site/year combinations where this is not possible. To see which cancer sites were affected by the stage adjustments, click on the "For More Information..." link located on the warning dialog that appears when you execute this session. This link is also available on the selection dialog and within the dictionary editor when working with the Summary stage 2000 (1998+) variable.

Step 8:  The Results Matrix

  • Use the Save As command on the File menu to save the matrix. Enter "Survival Exercise 2" as the filename. SEER*Stat will assign the "ssm" extension to indicate that this is a "SEER*Stat Survival Matrix" file.
  • Compare your results to this SEER*Stat matrix file: Exercise Matrix 2 Results.
  • The results matrix consists of multiple pages of output. Use the drop down list on the toolbar to select a different page to view.
  • The Survival Results Matrix section of the help system contains more information about the SEER*Stat matrix and its features.