Reach/Frequency Estimation for the

Internet World Wide Web

 

 

 

 

by

 

Jongpil Hong

 

Doctoral Student

 

jphong@mail.utexas.edu

 

Department of Advertising

College of Communication

The University of Texas at Austin

Austin, Texas 78712

 

 

and

 

John D. Leckenby

 

Everett D. Collier Centennial Chair

in Communication

 

john.leckenby@mail.utexas.edu

http://uts.cc.utexas.edu/~tecas/

 

Department of Advertising

College of Communication

The University of Texas at Austin

Austin, Texas 78712

 

 

paper submitted to

 

1997 Annual Conference

American Academy of Advertising

St. Louis, Missouri

 

 

 

 

 

 

Reach/Frequency Estimation for the

Internet World Wide Web

 

 

Abstract

 

 

This paper examines the concepts of reach and frequency in the Internet media environment. Data were collected to provide input to three reach/frequency estimation methods and to provide a benchmark via tabulated schedules of Internet vehicles. Resul ts show that for the three models studied (Beta Binomial Distribution, Morganzstern Sequential Aggregation Distribution and the Conditional Beta Distribution) all perform less accurately in the new medium than they have in magazine and television environm ents. In some cases, however, the results are comparable to those found in other media. This study concludes it may be necessary to develop new models or modify existing models to meet the special requirements of the Internet medium. Since the data ba se is small, this study should be viewed as a pilot for larger-scale work (more subjects in audience measurement data base and more test schedules) before definitive conclusions are drawn.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Reach/Frequency Estimation for the

Internet World Wide Web

 

 

 

 

Introduction

 

Reach and frequency estimation of media schedules for media types such as magazines, spot, network and cable television, spot and network radio, newspapers, and other standard media types have become the common method of operation for media dir ectors and planners in advertising agencies over the last 50 years or so. These estimations are conducted using probability models of proprietary and non-proprietary natures.

All indications point to an increasing trend in the usage of reach/frequency models as seen in studies of the practices of media directors in the top 200 advertising agencies in the U.S. In 1982, 87.9% of these agencies used estimation models for reac h and 74.5% used them for the frequency distribution estimation. The corresponding figures for 1993 were 90.5% for reach and 87.3% for the frequency distribution (Leckenby and Kim 1994). While these increases in 1993 are not dramatic, it is clear the mo de of operation in media planning continues to rely upon reach/frequency estimation and the trend is upward rather than downward in usage.

With the advent of the new Internet World Wide Web (WWW) as a medium for advertisers, the issue arises as to the applicability of the standard operating approach in media planning as applied to this new medium. Are the twin concepts of reach and frequ ency useful tools for the building of WWW media schedules? If so, do the methods developed over the past 40 years in other standard media types work when used to estimate reach/frequency for this new medium? Or will entirely new methods need to be inven ted? This question assumes importance because of the rapid growth of this medium and its projected consequences.

The Internet has been the fastest growing medium in the last few years. User demand on the Internet has experienced dramatic increase since its launch. The approximate number of total Internet users is 32 million (Forrester Research 1996). The Inter net growth rate in domain and host numbers internationally also supports the notion of dramatic increase in demand on this medium. For example, between July 1995 and July 1996, the Americas experienced 138 percentage growth in domain numbers with a total of 8,224,279 hosts. Other countries experienced a more dramatic increases: 686 percent for Malaysia and 521 percent for Indonesia (CyberAtlas, 1996 at URL: http://www.cyberatlas.com/news.html). The explosive increase of the Internet can also be demonst rated by the number of downloads of the two most popular web browsers: Netscape and Internet Explorer. According to Netcape (1996 at URL: http://www.netscape.com/), its web site receives more than 85 million hits per day.

While the first question is outside the scope of this paper, it is clear that history is on the side of the application of the "old tools" to the "new media." When radio and television came along as new media available to advertise rs, many of the techniques in reach/frequency estimation as well as the terminologies of magazines and newspapers were applied to these new media. As noted earlier, 87.9 percent of media planners apply reach/frequency models to schedules consisting of se veral different media types (Leckenby and Kim 1994). It also worked the other way around; gross rating points (GRP) has come to be applied to magazines and other media when invented for the new medium of television. History points toward an interactive relationship between media planning applications in the "old" and the "new" media.

The second question serves as the focus of this paper. Standard media reach/frequency models developed for magazines, television and other media can be applied to the new WWW medium. The main question concerns how accurate these old methods may be in the new media environment. Since no data are publicly available of the type needed for such a study, original data need to be collected on WWW site audiences first and then used as input and benchmarks for accuracy of the old models in the new medium.

This approach will serve as the basis for this examination of Internet WWW reach/frequency planning.

 

 

Models

Three models which have served as the basis for performance comparisons in magazine and television media schedules, among others, will be used as the estimation methods in this study of Internet WWW reach/frequency: (1) Beta Binomial D istribution (BBD); (2) Sequential Aggregation Distribution (SAD); and (3) Conditional Beta Distribution (CBD).

These three models have been studied extensively and are selected to represent the spectrum of methods available for reach/frequency estimation (Chandon 1976; Danaher 1988a,b; Danaher 1989; Headen, Klompmaker and Rust, 1979; Hofmans 1969; Ju 1990; Kishi 1987; Leckenby and Kim 1992; Lee 1988; Rice 1985; Rust 1986; and Rust and Leone 1984). In addition, many of these approaches have been available for use in proprietary formats for several years (Lancaster 1987; and Telmar 1980).

 

Beta Binomial Distribution

This is one of the oldest known models which was developed by Richard Metheringham in the 1960’s (Metheringham 1964) for use in advertising agency media planning. It is the simplest of the models studied here and, therefore, has been frequentl y used in practice (Leckenby and Kim 1994). It is also the least accurate, generally speaking, of any of the known models except the binomial distribution (Leckenby and Ju 1990). But it serves as a benchmark for more complex models and may be appropriat e in certain media situations. If the Internet exhibits low between-vehicle duplication, for example, then this method may be appropriate. The main problem in the BBD lies in its estimation of between-vehicle duplication (cross-pair duplication) which r esults in low overall estimation accuracy.

 

Sequential Aggregation Distribution

This method uses Morgenzstern’s reach formula to estimate reach (Chandon 1976). Then, each vehicle’s marginal probability distribution is developed using the BBD separately for each vehicle. This overcomes the problem of the BBD used alone co ncerning between-vehicle duplication. These marginal distributions are combined sequentially to form a two-dimensional joint exposure distribution which is collapsed at each step along the main diagonals to form a marginal distribution to combine with th e next vehicle’s marginal distribution. It is known that different order of vehicle aggregation produces different results (Lee 1988). This method is often used in practice (Leckenby and Kim 1994) and is very accurate (Leckenby and Ju 1990) if theoretic ally inelegant.

 

Conditional Beta Distribution

This method was developed by Kim (1994) to overcome some of the theoretical problems of the Sequential Aggregation Distribution method. It combines the use of the Canonical Expansion method of Danaher (1988a) and the Beta Binomial Distribution used as conditional distributions. It uses the BBD as the basis for all estimations but in a model which uses the BBD for separate parts of the modeling for between- and within-vehicle duplication. It has proven very accurate in magazine schedule testi ng (Kim 1994). And, unlike the DMDLK (Leckenby and Kishi 1984), this method is applicable to all schedules whether they have equal or unequal insertions in vehicles. Unlike SAD, this method is a true multivariate probability distribution and, therefore, behaves in predictable ways.

 

 

Test Procedure

The history of such studies shows that, most frequently, input data for the models and benchmark data for accuracy assessment of those models are syndicated data such as those developed routinely by Simmons Market Research Bureau, A.C. N ielsen or, prior to their demise, Arbitron, Inc. Unfortunately, the data currently available for the Internet WWW do not provide the type of information needed for a full test of reach/frequency model performance.

Currently, data are available on WWW site audiences relating to total hits and its variations (URLs: http://www.ipro.com; http://www.webtrack.com; http://techweb.cmp.com/techweb/ia/features/history.html). In addition, the concept of "cookie s" has provided a glimpse of individual user tracking on sites (Barr 1996; Liberatore 1996; Mills 1996). What is needed for reach/frequency models of the type to be tested here are data on "unique users" of a site on two subsequent occasio ns. These data would allow the establishment of within- and between-vehicle duplication as well as average site audience. Hong and Leckenby (1996) showed how to estimate unique users as a function of total site hits in a given time frame; however, this estimation process cannot take the place of actual unique user measurement since that process provides no estimate of duplications.

Because of this situation, the authors undertook a small pilot study to collect data on WWW site audiences of the type needed to conduct full-scale tests of the models. Because of the limited audience demographics and sample size, these data are m eant to be illustrative rather than definitive as a means for testing the models in this study.

 

WWW Audience Measurement Procedure

During the Fall of 1996, 92 students in advertising classes at a large southwestern university were asked to complete a group-administered questionnaire which closely follows the ‘Recent Reading’ method of magazine audience measurement as emplo yed by MRI, Inc. (Mediamark Research Inc. 1984).

In this methodology, two questions are asked after the respondents were shown, via computer projection, the "front page" of each of the top 30 WWW sites with respect to current advertising revenues as compiled by Jupiter Communications i n Fall 1996 (URL: http://www.jupiter.com/):

 

Question (1): "Do you think you have ever read or looked into this web site during the last six months?

___ Yes. I am sure I have.

___ I am not sure.

___ No. I am sure I have not.

 

Question (2): "Did you happen to read or look into this web site in the past 7 days?

___ Yes. I am sure I did.

___ I am not sure.

___ No. I am sure I didn’t.

 

These questions were asked of the sites after each was shown on the large-screen computer projection system for 15 seconds each. They were shown in alphabetical order. This constituted the Phase I data collection process.

Seven days later, the process was repeated for Phase II of the data collection. In this phase, the sites were shown in reverse alphabetical order to control for order effects. The measurement took about 20 minutes in each phase of data collection . Respondents provided their ID number as the basis for questionnaire matching on the two occasions.

Calculation of audience statistics was based upon the definition used by SMRB (Simmons Market Research Bureau 1989). The Average Site Audience was calculated as:

 

Average Site Audience = (Site viewers in Week 1 + Site viewers in Week 2) / 2 (1)

 

The Cumulative Site Audience was calculated as:

 

Cumulative Site Audience = (Site viewers in Week 1 or Site viewers in Week 2

or Both Viewers) (2)

 

The Between-Vehicle Duplication was calculated as:

 

Between-Vehicle Duplication = (Site 1 and Site 2 Viewers) (3)

 

These data were divided by 92 (measurement sample size) to provide estimates of the audience proportions which are input to the media reach/frequency estimation models. To qualify as a "viewer," the respondent must have answered & quot;Yes. I am sure I did." to Question #2 above.

The above three media vehicle statistics for each of the 30 sites served as input data for the three model estimations. One site (iWorld) was dropped since its audience was non-existent over the two phases of measurement. Table1 and Table 2 summa rize the results of the average site audience, the cumulative site audience, and the between-vehicle duplication for the 30 sites.

 

Benchmark Tabulations

Because this is a pilot study, 28 schedules of WWW site vehicles were developed to show a reasonable test of the models’ efficacy. Table 3 shows the distribution of these 28 schedules by number of vehicles. The vehicles each contained two ins ertions to be compatible with the tabulation and measurement systems over Phase I and Phase II. There were four schedules each of sizes 2, 3, 4,5,6,7 and 8 vehicles, respectively. This provided exposure distributions in range of size from 4 insertions t otal to 16 insertions total.

Tabulation involves, for each schedule, counting person-by-person exposure to each of the vehicles on each of the two measurement phases. This results in the "true" answer for the sample of exposure to the schedule vehicles over two occa sions. This shows, in a two-vehicle schedule, for example, the proportion of the sample exposed no times, one time, two times, three times or four times to the vehicles in the schedule. Notice that measurement is about the general site and not specific renditions of the site.

The reach and frequency distributions for these schedules were then estimated using the three models. The performance of the models is assessed using standard error criteria described below.

 

Performance Evaluation Criteria

Definition of Error

 

In evaluating performances of different models, their accuracy depends partly on the manner in which error is defined in the study. In this study, two different error factors, error in reach estimation (AER) and error in the exposure distribution (APE ), are adopted from previous studies (Kishi and Leckenby 1982; Leckenby and Kishi 1984). Danaher (1991) also used these definitions of AER and APE (he renamed AER as RER, "relative error in reach," and APE as EPOR, "error in exposure probab ilities over schedule reach").

The error in the reach estimates for the test schedules was defined as the absolute value of the difference between the observed and predicted reach in terms of percentage.

 

Average percentage error in reach

(AER) AER=S (|oi-ei|/oi)/K (4)

where:

oi = observed reach of schedule i

ei = estimated reach of schedule i

K = total number of schedules.

 

The error in the each exposure level is simply defined as the absolute difference between the observed and the estimate frequencies.

 

Average percentage error in exposure distribution:

(APE) APE = (S PEi ) / K (5)

where:

S PEi = (S | oij - eij| ) /S oij

PEi = percentage error in the schedule i

oij = observed frequency at exposure level j of schedule i

eij = estimated frequency at exposure level j of schedule i

S oij = observed reach of schedule i

K = total number of schedules.

 

 

Results

The Average Percentage Errors in Reach (AER)

Tables 4 and 5 provide two example schedules (a small—3-vehicle, and a large—8 vehicle are shown) to give some sense of the typical shapes of the exposure distribution and the magnitude of errors for each schedule. Typically, the distributions are more complex than those found in magazines or television. The primary contributing factor is not the high cumulative audience which contributes to bumps at even-numbered frequencies (this is observed in other media types as well), but rather the low between-vehicle or cross-pair duplication because of the tremendous number of sites available to the viewer.

Table 6 shows the Average Error in Reach results for the sample of 28 schedules. Clearly, of the three models, the Beta Binomial Distribution produces the lowest error, followed by CBD and MSAD. This is contrary to previous findings in other medi a types where the BBD usually produces the poorest results in accuracy. Note the BBD and CBD are not very far apart on AER.

 

The Average Percentage Errors in the Exposure Distribution (APE)

The Average Percentage Errors in the Distribution (APE) are shown for the two sample schedules in Tables 4 and 5.

The overall results for APE for the sample of 28 schedules are shown in Table 6. These errors, again, are relatively large compared to other media types. This can be seen by examining Table 7 for other magazine and television study results on som e of the same models studied here. While in other studies, the CBD shows about a 3 percent AER and 14 percent APE, in this study the AER is about 7 percent and APE shown in Table 6 is about 35 percent, considerably higher.

 

 

Conclusion

This is the first study of which the authors are aware which formally has studied the estimation characteristics of existing reach/frequency models in the Internet medium environment. In this sense, this study is pioneering.

This study has examined the performance of existing reach/frequency estimation models for a sample of 28 Internet WWW media schedules ranging in size from two to eight vehicles and four to sixteen insertions. Results show that the models do not perfor m as well on Internet data as they do for either magazine or television data.

Within the study, it is clear the Conditional Beta Distribution (CBD), overall, outperforms the Beta Binomial Distribution (BBD) and the Morgenzstern Sequential Aggregation Distribution (MSAD). This is an unusual finding since invariably, the MSAD has performed quite well in other media environments.

The data here are based upon a very small sample size of college students. This group is appropriate for study in this medium since it is known to be upscale educationally. But the small number of respondents in the data collection phase of this study and the small number of schedules examined undoubtedly have led to some bias. Other studies have used larger schedules to serve as the basis for error statistics; all indications point to a larger error than observed had that been done in the curre nt study since, usually, larger schedules are harder to estimate and lead to larger error in other media types.

Results of this study, though preliminary, point to the need to develop new or modify existing reach/frequency methods to accurately model the Internet WWW audience.

Table 1

 

Top 30 sites selected in this study and Top 30 sites reported by Interactive Age

 

 

Top 30 sites in this study

(Average Audience in %)

Top 30 sites based on

Number of Unique Users *

1

Netscape (50.5)

Netscape

2

Yahoo (45.1)

ESPNEet SportsZone

3

Lycos (19.6)

Lycos

4

InfoSeek (18.5)

Yahoo

5

Excite (15.8)

Pathfinder

6

Magellan Internet Directory (14.7)

Playboy

7

WebCrawler (12.5)

InfoSeek

8

ESPNET Sports Zone (12.5)

Penthouse

9

CNN Interactive (8.2)

HotWired

10

NBA.com (Starwave) (5.4)

Novell

11

Cool Site of the Day (4.4)

Digital Equipment

12

Playboy (4.3)

Microsoft

13

Discovery Channel Online (3.8)

Global Network Navigator

14

USA Today (3.8)

Electronic Newstand

15

U.S. News Online (3.8)

First Virtual Bank

16

C|net: the computer network (3.3)

Times FAX/New York Times

17

TradeWave Galaxy (3.3)

Internet Shopping Network

18

Netscape World (3.3)

News and Observer Publishing

19

HotWired Magazine (3.2)

Silicon Graphics

20

NewsPage (individual inc.) (2.7)

Beverly Hills Internet

21

Pathfinder (2.7)

NewsPage

22

Wall Street Journal Interactive (2.7)

Netcom Online

23

Cybershop (1.6)

Apple Computer

24

Women's Wire (1.6)

Sony Online

25

GolfWeb (1.1)

eMall

26

Riddler (1.1)

Spyglass

27

Word (1.1)

Spry Inc.

28

NandO.net (0.5)

Mercury Center

29

Zd Net (0.5)

Hollywood Online

30

iWorld (0.0)

1-800 Flowers

 

* as reported by Interactive Age 1995

(URL: http://techweb.cmp.com/techweb/ia/features/hitstory.html/)

Table 2

 

Within-vehicle and Between-vehicle Duplication

 

 

C|net

CNN

Cool

Site

Cyber-

shop

Discovery

ESPNET

Excite

GolfWeb

HotWired

Infoseek

iWorld

Lycos

Magellan

NandO

NBA

Netscape

Netscape

World

NewsPage

Pathfinder

Playboy

Riddler

Galaxy

US News

USA Today

Wall

St J

Web

Crawler

Women's

Wire

Word

Yahoo

ZD Net

C|net

2.2

                                                         

CNN

1.6

5.4

                                                       

Cool Site

0.3

1.1

2.2

                                                     

Cybershop

0.3

0.8

0.5

0

                                                   

Discovery

1.1

1.6

0.5

1.1

2.2

                                                 

ESPNET

2.5

4.1

0.3

0.3

1.1

9.8

                                               

Excite

1.9

2.5

1.4

0.5

1.6

4.9

7.6

                                             

GolfWeb

0.3

0.5

0.3

0.3

0.8

0.3

0.3

0

                                           

HotWired

0.3

1.1

0.8

0.3

0.5

0.8

0.8

0.3

2.2

                                         

InfoSeek

2.2

2.7

2.2

1.4

2.7

2.7

7.3

0.5

2.2

13

                                       

iWorld

0

0

0

0

0

0

0

0

0

0

0

                                     

Lycos

1.9

2.7

0.8

0.5

1.9

5.2

7.6

0.5

1.1

9.8

0

10.9

                                   

Magellan

1.9

2.7

2.2

0.8

1.6

2.5

7.3

0.3

0.8

9

0

6.8

7.6

                                 

NandO.net

0

0.3

0

0.3

0.5

0

0.3

0

0

0.5

0

0.3

0.3

0

                               

NBA.com

0.3

1.1

0

0

0.3

3.5

1.4

0.3

0.5

1.1

0

2.7

0.5

0

3.3

                             

Netscape

3

4.9

3.5

1.1

3

7.3

10.3

0.8

2.7

14.4

0

13.3

10.3

0.3

2.2

40.2

                           

Netscape

World

0.3

0.8

0.3

0.5

1.4

0.3

1.1

0.5

0.3

1.6

0

1.4

0.8

0.3

0.3

2.5

0

                         

NewsPage

0.3

0.8

1.4

0.5

1.1

0.3

1.4

0.3

0.3

2.2

0

0.8

1.9

0.3

0

2.7

0.5

1.1

                       

Pathfinder

0.3

1.6

0.3

0.8

1.6

0.8

0.8

0.3

0.5

2.2

0

1.4

1.1

0.8

0.5

2.2

0.8

0.8

1.1

                     

Playboy

0.5

0

0

0

0.5

1.6

1.6

0.3

0

1.4

0

1.6

0.5

0

0.8

2.2

0.3

0

0

0

                   

Riddler

0.3

0.5

0.3

0.3

0.5

0.8

0.8

0.3

0.3

0.5

0

0.3

0.3

0

0

0.5

0.3

0.3

0.3

0.3

0

                 

Galaxy

0.3

0.8

0.3

0.5

1.4

1.4

1.1

0.5

0.3

2.2

0

2.5

1.1

0.3

0.3

2.7

0.8

0.5

0.8

0.3

0.3

1.1

               

U.S. News

0.5

2.7

0.5

0.8

1.9

1.6

2.5

0.8

0.5

2.2

0

1.6

2.5

0.3

0.3

2.2

1.1

0.8

1.1

0.3

0.5

1.1

2.2

             

USA Today

0.5

1.6

0.5

0.8

1.9

1.1

1.9

0.8

0.5

2.2

0

1.6

1.4

0.3

0.8

2.2

1.1

0.8

1.1

0.3

0.5

1.1

1.9

2.2

           

Wall Street

Journal

0.5

1.4

0.5

0.8

1.6

1.6

0.8

0.5

0.5

1.6

0

0.8

0.8

0.3

0

2.7

0.8

0.8

1.1

0

0.5

0.8

1.4

1.4

2.2

         

Web

Crawler

1.9

2.5

0.5

0.3

1.1

4.4

3.5

0.5

0.8

3.5

0

4.9

3

0

1.4

4.4

0.8

0.3

0.5

0.5

0.3

1.1

1.1

1.1

1.6

6.5

       

Women's

Wire

0.3

0.5

0.5

0.3

0.8

0.3

0.5

0.5

0.3

0.5

0

0.8

0.5

0

0.3

1.1

0.5

0.3

0.3

0.3

0.3

0.5

0.8

0.8

0.5

0.5

0

     

Word

0.3

0.5

0.3

0.3

0.5

0.3

0.3

0.3

0.3

0.5

0

0.3

0.3

0

0

0.5

0.3

0.3

0.3

0

0.3

0.3

0.5

0.5

0.5

0.3

0.3

0

   

Yahoo

3.3

7.9

4.1

1.4

2.7

7.9

11.7

1.1

1.6

13.9

0

15.2

13

0.3

3

26.6

2.5

2.2

2.2

3

0.8

3

3.5

2.5

1.4

8.2

1.6

0.5

35.9

 

Zd Net

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0.3

0

0

0

0

0

0

0

0

0

0

 

Table 3

 

Tabulated Schedule Profiles

(n=28)

 

Schedule

Vehicles

Two-vehicle

(4 inserts)

#1: Netscape, Yahoo

#2: CNN Interactive, Netscape

#3: Cool Site of the Day, Magellan

#4: ESPNET Sports Zone, NBA.com

 

Three-vehicle

(6 inserts)

 

#1: Lycos, Netscape, USA Today

#2: ESPNET Sports Zone, Excite, Yahoo

#3: CNN Interactive, Discovery Channel Online, Magellan

#4: NBA.com, Playboy, WebCrawler

 

Four-vehicle

(8 inserts)

 

#1: CNN Interactive, ESPNET Sports Zone, Netscape, Yahoo

#2: Excite, Infoseek, Lycos, Magellan

#3: C|net, CNN Interactive, Discovery Channel Online, Playboy

#4: Cool Site of the Day, Discovery Channel Online , Netscape, WebCrawler

 

Five-vehicle

(10 inserts)

#1: ESPNET Sports Zone, Infoseek, , Netscape, NBA.com, US News Online

#2: Excite, Infoseek, Lycos, Magellan, Yahoo

#3: CNN Interactive, Cool Site of the Day, HotWired, Playboy, WebCrawler

#4: C|net, HotWired, NBA.com, Pathfinder, Yahoo

 

Six-vehicle

(12 inserts)

#1: CNN Interactive, Cool Site of the Day, Excite, ESPNET Sports Zone, Lycos,

TradeWave Galaxy

#2: Excite, Infoseek, Lycos, Magellan, WebCrawler, Yahoo

#3: C|net, Magellan, Netscape, Netscape World, NewsPage, US News Online

#4: Cool Site of the Day, Discovery Channel Online, InfoSeek, Playboy,

WebCrawler, USA Today

 

Seven-vehicle

(14 inserts)

#1: C|net, Discovery Channel Online, ESPN Sports Zone, HotWired, Lycos,

NBA.com, Yahoo

#2: CNN Interactive, Cool Site of the Day, Infoseek, Magellan, Netscape,

Pathfinder, US News Online,

#3: CyberShop, ESPNET Sports Zone, InfoSeek, Playboy, TradeWave Galaxy,

WebCrawler, Wall Street Journal Interactive

#4: C|net, CNN Interactive, Excite, Magellan, Pathfinder, WebCrawler, Women's

Wire

 

Eight-vehicle

(16 inserts)

#1: Excite, InfoSeek, Lycos, Magellan, Netscape, TradeWave Galaxy,

WebCrawler, Yahoo

#2: C|net, CNN Interactive, HotWired, Lycos, NewsPage, Pathfinder, USA Today,

US News Online

#3: C|net, CNN Interactive, CyberShop, Discovery Channel Online, Excite,

Pathfinder, Playboy, Women's Wire,

#4: Cool Site of the Day, ESPN Sports Zone, GolfWeb, Magellan, NBA.com,

Playboy, USA Today, Yahoo

 

 

 

 

Table 4

Sample Exposure Distribution for a Three-vehicle Schedule

 

(Two inserts in Excite, ESPNET Sports Zone, and Yahoo)

 

 

Observed

 

SAD

BBD

CBD

# of Exposure

%

%

%

%

0

39.13

39.39

35.64

36.14

1

11.96

12.05

24.06

17.11

2

31.52

24.76

16.81

25.79

3

4.35

13.07

11.33

10.22

4

8.70

8.22

7.04

7.53

5

2.17

1.92

3.74

2.01

6

2.17

0.59

1.37

0.92

Errors

       

AER

 

.48

5.68

4.42

APE

 

29.92

67.77

36.24

 

 

 

Table 5

Sample Exposure Distribution for an Eight-vehicle Schedule

 

(Two inserts in Cool Site of the Day, ESPNET Sports Zone, GolfWeb, Magellan,

NBA.com, Playboy, USA Today, and Yahoo)

 

 

Observed

 

SAD

BBD

CBD

# of Exposure

%

%

%

%

0

39.13

34.02

31.69

33.77

1

11.96

12.06

23.40

16.07

2

17.39

20.34

16.30

20.22

3

10.87

15.25

10.96

11.67

4

10.87

10.97

7.13

9.34

5

2.17

4.74

4.49

4.95

6

5.44

1.94

2.72

2.70

7

1.09

0.51

1.58

0.91

8

1.09

0.13

0.88

0.29

9

0

0.02

0.46

0.06

10

0

0

0.23

0.01

11

0

0

0.10

0

12

0

0

0.04

0

13

0

0

0.01

0

14

0

0

0

0

15

0

0

0

0

16

0

0

0

0

Errors

       

AER

 

8.34

12.16

8.75

APE

 

32.97

49.67

34.60

 

 

Table 6

 

Summary of Average Error Calculations

 

 

Internet schedules (n=28)

 

 

Error Type

Model

AER

APE

SAD

9.11

43.34

BBD

5.72

46.82

CBD

6.78

34.97

 

 

 

Table 7

 

 

New Zealand 1985 AGB Magazine Data (n=600)

 

 

Error Type

Model

AER (%)

APE (%)

DMDLK

2.23

15.80

LOGLIN

1.20

12.83

CANEX

2.19

14.82

BBD

11.85

50.40

 

 

SMRB US 1979 Magazine Data (n=515)

 

 

Error Type

Model

AER (%)

APE (%)

DMDLK

2.77

17.68

CANEX

3.08

19.91

BBD

6.46

33.23

CBD

3.12

13.76

MSAD

3.13

15.27

 

 

SMRB US 1984 Magazine Data (n=508)

 

 

Error Type

Model

AER (%)

APE (%)

DMDLK

1.73

23.30

CANEX

3.26

25.83

BBD

4.92

33.59

CBD

3.26

17.85

MSAD

5.84

20.58

 

 

Table 7 (continued)

 

 

TV SMRB Data 1984

 

 

Error Type

Model

AER (%)

APE (%)

ALBET

2.7

12.5

BBD-LD

3.9

5.3

DMDLK

3.5

12.6

BBD-IE

3.1

13.7

 

 

 

References

 

Barr, Christopher (1996), "The Truth about Cookies," [URL: http://www.cnet.com/Content/Voices/Barr/042996/].

 

Chandon, Jean-Louis (1976), A Comparative Study of Media Exposure Models. Unpublished doctoral dissertation, Northwestern University.

 

Danaher, Peter J. (1988a), "Parameter Estimation for the Dirichlet-Multinomial Distribution Using Supplementary Beta-Binomial Data," Communications in Statistics, A17, 6 (June), 777-88.

 

____________(1988b), "A Loglinear Model for Predicting Magazine Audiences," Journal of Marketing Research, 25 (November), 356-62.

 

____________(1989), "An Approximate Loglinear Model for Predicting Magazine)Audiences," Journal of Marketing Research, 26 (November), 473-9.

 

___________(1991), "A Canonical Expansion Model for Multivariate Media Exposure Distributions: A Generalization of the 'Duplication of Viewing Law'," Journal of Marketing Research, 28 (October), 361-7.

 

Forrester Research (1996). [URL: http://www.forrester.com].

 

Headen, Robert S., Jay E. Klompmaker, and Roland Rust (1979), "The Duplication of Viewing Law and Television Media Schedule Evaluation," Journal of Marketing Research, 16 (August), 333-40.

 

Hofmans, Pierre (1969), "Measuring the Cumulative Net Coverage of Any Combination of Media," Journal of Marketing Research, 3 (August), 269-78.

 

Hong, Jongpil and John D. Leckenby (1996), "Audience Measurement and Media Reach/Frequency Issues in Internet Advertising," in Proceedings of the American Academy of Advertising, 15-27.

 

Ju, Kuen-Hee (1990), Simple Approaches to Modeling Advertising Media Exposures. Unpublished doctoral dissertation, The University of Texas at Austin, Austin, Texas.

 

Kim, Heejin (1994), A Conditional Beta Distribution Model for Advertising Reach/Frequency Estimation. Unpublished doctoral dissertation, The University of Texas at Austin, Austin, Texas.

 

Kim, Heejin and John D. Leckenby (1993), "A Test of Canonical Expansion Reach/Frequency Model," in Proceedings of the American Academy of Advertising, 51-59.

 

Kishi, Shizue (1987), "Exposure Distribution Models in Print, Spot-TV, and Mixed-Media Schedules: Empirical Test on Japanese Data," The Annual Conference of the European Marketing Academy, Ontario, Canada.

 

______________and John D. Leckenby (1982), "A Test of the Direct/Indirect BBD and Other Exposure Distribution Models," in Proceedings of the American Academy of Advertising, 46-52.

 

Lancaster, Kent M. (1987), "Optimizing Advertising Media Plans Using ADOPT on the Microcomputer," Paper presented at the Fourth AMA Microcomputers in Marketing Workshop, University of Hawaii at Manoa Honolulu.

 

Leckenby, John D. and Heejin Kim (1994), "How Media Directors View Reach/Frequency Estimation: Now and a Decade Ago," Journal of Advertising Research, 34 (September/October), 9-21.

 

_______________and Heejin Kim (1992), "Unresolved Issues in Media Reach/Frequency Models," in Proceedings of the American Academy of Advertising, 100-106.

 

_______________and Kuen-Hee Ju (1990), "Advances in Media Decision Models," Current Issues and Research in Advertising, 311-357.

 

_______________and Shizue Kishi (1984), "The Dirichlet Multinomial Distribution as a Magazine Exposure Model," Journal of Marketing Research, 21 (February), 100-106.

 

Lee, Hae Kap (1988), Sequential Aggregation Advertising Media Models. Unpublished doctoral dissertation, University of Texas at Austin, Austin Texas.

 

Liberatore, Karen (1996), "Personal Data for Sale," [URL: http://www.macworld.com/netsmart/].

Metheringham, Richard A. (1964), "Measuring the Net Cumulative Coverage of a Print Campaign," Journal of Advertising Research, 4 (December), 23-28.

 

Mills, Elinor (1996), "Tech Update Navigator `Cookies' Files Will Monitor Web User Activity," [URL: http://www.infoworld.com/archives/html/dt_IWE09-96_59.htm].

 

Rice, Marshall D. (1985), Television Exposure Distribution Models in Advertising Media, Unpublished doctoral dissertation, University of Illinois at Urbana-Champaign, Urbana, IL.

 

Rust, Roland T. (1986), Advertising Media Models: A Practical Guide. Lexington, MA: Lexington Books.

 

______________and Robert P. Leone (1984), "The Mixed Media Dirichlet-Multinomial Distribution: A Model for Evaluating Television-Magazine Advertising Schedules," Journal of Marketing Research, 21 (February), 89-99.

 

Simmons Market Research Bureau (1989). Simmons Technical Guide.

 

______________(1980), "Selected Systems for Media Planning and Buying," New York: Telmar Media System Inc.