An Analysis of Ten Years of the Four Grand Slam Men's Singles Data for Lack of Independence of Set Outcome

Graham Pollard, R Cross, D Meyer

    Research output: Contribution to journalArticle

    4 Citations (Scopus)

    Abstract

    The objective of this paper is to use data from the highest level in men’s tennis to assess whether there is any evidence to reject the hypothesis that the two players in a match have a constant probability of winning each set in the match. The data consists of all 4883 matches of grand slam men’s singles over a 10 year period from 1995 to 2004. Each match is categorised by its sequence of win (W) or loss (L) (in set 1, set 2, set 3,...) to the eventual winner. Thus, there are several categories of matches from WWW to LLWWW. The methodology involves fitting several probabilistic models to the frequencies of the above ten categories. One four-set category is observed to occur significantly more often than the other two. Correspondingly, a couple of the five-set categories occur more frequently than the others. This pattern is consistent when the data is split into two five-year subsets. The data provides significant statistical evidence that the probability of winning a set within a match varies from set to set. The data supports the conclusion that, at the highest level of men’s singles tennis, the better player (not necessarily the winner) lifts his play in certain situations at least some of the time.

    Original languageEnglish
    Pages (from-to)561-566
    Number of pages6
    JournalJournal of Sports Science and Medicine
    Volume5
    Issue number4
    Publication statusPublished - 2006

    Fingerprint

    Tennis
    Statistical Models

    Cite this

    @article{502075262f814b7b9a03ef01df6f3370,
    title = "An Analysis of Ten Years of the Four Grand Slam Men's Singles Data for Lack of Independence of Set Outcome",
    abstract = "The objective of this paper is to use data from the highest level in men’s tennis to assess whether there is any evidence to reject the hypothesis that the two players in a match have a constant probability of winning each set in the match. The data consists of all 4883 matches of grand slam men’s singles over a 10 year period from 1995 to 2004. Each match is categorised by its sequence of win (W) or loss (L) (in set 1, set 2, set 3,...) to the eventual winner. Thus, there are several categories of matches from WWW to LLWWW. The methodology involves fitting several probabilistic models to the frequencies of the above ten categories. One four-set category is observed to occur significantly more often than the other two. Correspondingly, a couple of the five-set categories occur more frequently than the others. This pattern is consistent when the data is split into two five-year subsets. The data provides significant statistical evidence that the probability of winning a set within a match varies from set to set. The data supports the conclusion that, at the highest level of men’s singles tennis, the better player (not necessarily the winner) lifts his play in certain situations at least some of the time.",
    author = "Graham Pollard and R Cross and D Meyer",
    year = "2006",
    language = "English",
    volume = "5",
    pages = "561--566",
    journal = "Journal of Sports Science and Medicine",
    issn = "1303-2968",
    publisher = "Department of Sports Medicine, Medical Faculty of Uludag University",
    number = "4",

    }

    An Analysis of Ten Years of the Four Grand Slam Men's Singles Data for Lack of Independence of Set Outcome. / Pollard, Graham; Cross, R; Meyer, D.

    In: Journal of Sports Science and Medicine, Vol. 5, No. 4, 2006, p. 561-566.

    Research output: Contribution to journalArticle

    TY - JOUR

    T1 - An Analysis of Ten Years of the Four Grand Slam Men's Singles Data for Lack of Independence of Set Outcome

    AU - Pollard, Graham

    AU - Cross, R

    AU - Meyer, D

    PY - 2006

    Y1 - 2006

    N2 - The objective of this paper is to use data from the highest level in men’s tennis to assess whether there is any evidence to reject the hypothesis that the two players in a match have a constant probability of winning each set in the match. The data consists of all 4883 matches of grand slam men’s singles over a 10 year period from 1995 to 2004. Each match is categorised by its sequence of win (W) or loss (L) (in set 1, set 2, set 3,...) to the eventual winner. Thus, there are several categories of matches from WWW to LLWWW. The methodology involves fitting several probabilistic models to the frequencies of the above ten categories. One four-set category is observed to occur significantly more often than the other two. Correspondingly, a couple of the five-set categories occur more frequently than the others. This pattern is consistent when the data is split into two five-year subsets. The data provides significant statistical evidence that the probability of winning a set within a match varies from set to set. The data supports the conclusion that, at the highest level of men’s singles tennis, the better player (not necessarily the winner) lifts his play in certain situations at least some of the time.

    AB - The objective of this paper is to use data from the highest level in men’s tennis to assess whether there is any evidence to reject the hypothesis that the two players in a match have a constant probability of winning each set in the match. The data consists of all 4883 matches of grand slam men’s singles over a 10 year period from 1995 to 2004. Each match is categorised by its sequence of win (W) or loss (L) (in set 1, set 2, set 3,...) to the eventual winner. Thus, there are several categories of matches from WWW to LLWWW. The methodology involves fitting several probabilistic models to the frequencies of the above ten categories. One four-set category is observed to occur significantly more often than the other two. Correspondingly, a couple of the five-set categories occur more frequently than the others. This pattern is consistent when the data is split into two five-year subsets. The data provides significant statistical evidence that the probability of winning a set within a match varies from set to set. The data supports the conclusion that, at the highest level of men’s singles tennis, the better player (not necessarily the winner) lifts his play in certain situations at least some of the time.

    M3 - Article

    VL - 5

    SP - 561

    EP - 566

    JO - Journal of Sports Science and Medicine

    JF - Journal of Sports Science and Medicine

    SN - 1303-2968

    IS - 4

    ER -