nci logo
NIH
U.S. National Institutes of Health National Cancer Institute

Technical Notes: Joinpoint Regression Analysis of Cancer Trends

An advance in the presentation of cancer trends is the use of joinpoint models (Kim et al., 2000). In some past issues of the Cancer Statistics Review, certain time intervals (e.g., 1973-1996) were specified and the annual percent changes (APC) were computed over those intervals. The choices of where to start and where to end an interval were arbitrary and sometimes did not give an accurate picture of the trend for a given cancer site. For example, the rates might be increasing and decreasing in different parts of the same interval. For some sites, increases occurred in the earlier years, followed by declines in more recent years.

To achieve greater descriptive accuracy, a statistical algorithm finds the optimal number and location of places where a trend changes. The point (in time) where a trend changes is called a joinpoint. Trends may change in different ways at a joinpoint: from up to down, from down to up, from up to up at a different rate, or from down to down at a different rate. A joinpoint regression model describes the trends by a sequence of connected segments where each segment is connected by a straight line on a log scale. Adjacent segments are connected at a joinpoint. The segments are connected because we assume that rates generally change smoothly, rather than "jump" abruptly. The rates are assumed to grow or decay exponentially, i.e., to change by a constant percentage each year. Thus the slope in each segment can be associated with a fixed annual percent change (APC).

Joinpoint analysis first assumes no joinpoints are needed to describe the data accurately, i.e., the trend over the entire interval 1975-2005 does not change. Joinpoints are added in turn if they are statistically significant. Thus, in the final model, each joinpoint represents a significant change in trend. Computational considerations currently limit the maximum possible number of segments to be no larger than four, with three joinpoints. Smoother polynomial models may provide a good fit overall, but are less sensitive to what is occurring at the ends of the data.

In running the Joinpoint program, we set the program parameters as follows: maximum number of joinpoints 3, minimum interval between joinpoints 2 years, minimum interval between a joinpoint and an endpoint 2 years, joinpoints occurring only at exact years. These restrictions provide some added stability to the resultant models. Different values for these parameters may yield a different joinpoint model. Since the test statistic to determine if additional joinpoints are necessary cannot be compared against any known standard distribution to determine significance, (e.g., the normal, t, or f) a permutation test is used which simulates the distribution of the test statistic under the null hypothesis. Thus an element of randomness is introduced by the random number stream used. However, for greater consistency in the p-values obtained if one were to change the random seed for each run, we run the program for 4499 permutations.

Average Annual Percent Change (AAPC) is a summary measure of a trend over a pre-specified fixed interval based on an underlying joinpoint model. It allows us to use a single number to describe the average trend over a period of multiple years. It can be estimated even if the joinpoint model indicates that there were changes in trends during those years, since it is estimated as a geometric weighted average of the joinpoint APCs, with the weights equal to the lengths of each segment over the pre-specified fixed interval. In this report, wehave included AAPCs as an addendum to the underlying joinpoint trends, and as a summary measure to compare fixed interval trends by race/ethnicity. For more information on how the AAPC is calculated and the advantages of reporting an AAPC over APCs, see "Average Annual Percent Change (AAPC)".

A Windows-based program, Joinpoint, is freely available; it accepts data from the SEER*Stat program, as well as user defined data. Further details on joinpoint regression may be found at the web site.