Finding Related Pages Using Green Measures:
An Illustration with Wikipedia, Companion Website.

Yann Ollivier, Pierre Senellart

This website presents additional data about the content presented in the paper Finding Related Pages Using Green Measures: An Illustration with Wikipedia, published in the proceedings of the AAAI-07 Conference. See also the corresponding presentation slides.

Implementation

The source code of the programs used for extracting the Wikipedia Graph and the implementation of the different methods for discovering related nodes in a graph are freely available:

Global results

Raw evaluation results, as a semicolon-delimited text file, are available here.

Number of votes: 67

Avg. ± conf.Article Std. dev.
Green 7.0 ± 0.3 0.9
SymGreen 6.3 ± 0.3 1.3
Cosine 5.2 ± 0.3 2.2
Cocitations 4.5 ± 0.3 1.9
PageRankOfLinks 2.2 ± 0.2 2.0

Base article: Clique (graph theory)

Number of votes: 41

Green
RankArticle
1. Clique (graph theory)
2. Graph (mathematics)
3. Graph theory
4. Category:Graph theory
5. NP-complete
6. Complement graph
7. Clique problem
8. Complete graph
9. Independent set
10. Maximum common subgraph isomorphism problem
11. Planar graph
12. Glossary of graph theory
13. Mathematics
14. Connectivity (graph theory)
15. Computer science
16. David S. Johnson
17. Independent set problem
18. Computational complexity theory
19. Set
20. Michael Garey
Avg. Mark 7.6 ± 0.3
Std. dev. 1.5
SymGreen
RankArticle
1. Clique (graph theory)
2. Graph (mathematics)
3. Complete graph
4. Graph theory
5. Category:Graph theory
6. Complement graph
7. Independent set
8. Clique problem
9. Glossary of graph theory
10. NP-complete
11. Edge contraction
12. Maximum common subgraph isomorphism problem
13. Connectivity (graph theory)
14. Planar graph
15. L (complexity)
16. Graph coloring
17. Tree (graph theory)
18. Degree (graph theory)
19. Bipartite graph
20. Chordal graph
Avg. Mark 7.6 ± 0.3
Std. dev. 1.4
Cosine
RankArticle
1. Clique (graph theory)
2. Covering (graph theory)
3. Rooted graph
4. Biconnected graph
5. Graph labeling
6. Dipole graph
7. Multiple edges
8. Path cover
9. Isolated vertex
10. Bidirected graph
11. Dominating set problem
12. Adjacent vertex
13. Path (graph theory)
14. Level structure
15. Loop (graph theory)
16. Degree (graph theory)
17. Complement graph
18. Independent set
19. Dense graph
20. Quartic graph
Avg. Mark 6.2 ± 0.4
Std. dev. 2.1
Cocitations
RankArticle
1. Clique (graph theory)
2. Graph (mathematics)
3. Graph theory
4. Category:Graph theory
5. Glossary of graph theory
6. Complement graph
7. Complete graph
8. Degree (graph theory)
9. Graph coloring
10. NP-complete
11. Adjacency matrix
12. Clique problem
13. Complete bipartite graph
14. Cycle (graph theory)
15. Independent set
16. Planar graph
17. Tree (graph theory)
18. Bipartite graph
19. Computational complexity theory
20. Connectivity (graph theory)
Avg. Mark 7.5 ± 0.3
Std. dev. 1.6
PageRankOfLinks
RankArticle
1. Graph theory
2. Graph (mathematics)
3. Category:Graph theory
4. NP-complete
5. Complete graph
6. Clique problem
7. Independent set
8. Clique (graph theory)
9. Complement graph
10. Maximum common subgraph isomorphism problem
Avg. Mark 6.8 ± 0.4
Std. dev. 1.9

Base article: Germany

Number of votes: 62

Green
RankArticle
1. Germany
2. Berlin
3. German language
4. Christian Democratic Union (Germany)
5. Austria
6. Hamburg
7. German reunification
8. Social Democratic Party of Germany
9. German Empire
10. German Democratic Republic
11. Bavaria
12. Stuttgart
13. States of Germany
14. Munich
15. European Union
16. National Socialist German Workers Party
17. World War II
18. Jean Edward Smith
19. Soviet Union
20. Rhine
Avg. Mark 7.0 ± 0.3
Std. dev. 1.6
SymGreen
RankArticle
1. Germany
2. Berlin
3. France
4. Austria
5. German language
6. Bavaria
7. World War II
8. German Democratic Republic
9. European Union
10. Hamburg
11. Christian Democratic Union (Germany)
12. West Germany
13. Denmark
14. Stuttgart
15. Social Democratic Party of Germany
16. German reunification
17. German Empire
18. States of Germany
19. Munich
20. Switzerland
Avg. Mark 5.5 ± 0.3
Std. dev. 1.7
Cosine
RankArticle
1. Germany
2. History of Germany since 1945
3. History of Germany
4. Timeline of German history
5. States of Germany
6. Politics of Germany
7. List of Germany-related topics
8. Hildesheimer Rabbinical Seminary
9. Pleasure Victim
10. German Unity Day
11. Gay rights in Germany
12. Wolfgang Becker
13. Kitty-Yo
14. Metrinomics - Metrivox
15. Germans
16. Basic Law for the Federal Republic of Germany
17. Autobahn
18. West Germany
19. German reunification
20. Veolia Verkehr
Avg. Mark 7.4 ± 0.2
Std. dev. 1.4
Cocitations
RankArticle
1. Germany
2. United States
3. France
4. United Kingdom
5. World War II
6. Italy
7. Netherlands
8. Japan
9. 2005
10. Category:Living people
11. Canada
12. Spain
13. Poland
14. Austria
15. Russia
16. Australia
17. England
18. 2004
19. Switzerland
20. Europe
Avg. Mark 2.1 ± 0.3
Std. dev. 1.8
PageRankOfLinks
RankArticle
1. United States
2. United Kingdom
3. France
4. 2005
5. Germany
6. World War II
7. Canada
8. English language
9. Japan
10. Italy
11. Europe
12. India
13. Russia
14. Latin
15. London
16. China
17. Soviet Union
18. French language
19. Roman Catholic Church
20. Netherlands
Avg. Mark 1.1 ± 0.2
Std. dev. 1.2

Base article: Hungarian language

Number of votes: 52

Green
RankArticle
1. Hungarian language
2. Slovakia
3. Romania
4. Slovenia
5. Hungarian alphabet
6. Hungary
7. Croatia
8. Category:Hungarian language
9. Turkic languages
10. Finno-Ugric languages
11. Austria
12. Serbia
13. Uralic languages
14. Ukraine
15. Hungarian grammar (verbs)
16. German language
17. Hungarian grammar
18. Khanty language
19. Hungarian phonology
20. Finnish language
Avg. Mark 6.2 ± 0.4
Std. dev. 1.9
SymGreen
RankArticle
1. Hungarian language
2. Hungary
3. Romania
4. Slovakia
5. Austria
6. Slovenia
7. German language
8. Palatal consonant
9. Finno-Ugric languages
10. Latin alphabet
11. Turkic languages
12. Uralic languages
13. Serbia
14. Finnish language
15. Declension
16. Hungarian alphabet
17. Croatia
18. Category:Hungarian language
19. Adjective
20. Polish language
Avg. Mark 5.8 ± 0.4
Std. dev. 2.1
Cosine
RankArticle
1. Hungarian language
2. Népújság
3. International Commission for the Protection of the Danube River
4. Category:Hungarian language
5. CEEPUS
6. Pannonian Plain
7. Partnership for Peace
8. Danube
9. Shopping City Süd
10. Alphabetic list of living languages in Europe
11. Hungarian people
12. Eastern Europe
13. POP Air Pollution Protocol
14. Wüstenrot-Gruppe
15. German names for Central European towns
16. Wassenaar Arrangement
17. Erste Bank
18. Jugoslovenske železnice
19. Slovenian language
20. Central European Media Enterprises
Avg. Mark 3.3 ± 0.4
Std. dev. 2.3
Cocitations
RankArticle
1. Hungarian language
2. German language
3. Hungary
4. Romania
5. Romanian language
6. English language
7. French language
8. Italian language
9. Polish language
10. Serbian language
11. Spanish language
12. Russian language
13. Slovak language
14. Latin
15. Croatian language
16. Hungarian people
17. Finnish language
18. Transylvania
19. Czech language
20. Portuguese language
Avg. Mark 3.8 ± 0.4
Std. dev. 2.0
PageRankOfLinks
RankArticle
1. United States
2. United Kingdom
3. France
4. Germany
5. Canada
6. English language
7. Italy
8. Australia
9. Latin
10. Greek language
11. Netherlands
12. World War I
13. German language
14. European Union
15. Switzerland
16. 19th century
17. Israel
18. Austria
19. Brazil
20. Belgium
Avg. Mark 0.5 ± 0.2
Std. dev. 0.9

Base article: Pierre de Fermat

Number of votes: 51

Green
RankArticle
1. Pierre de Fermat
2. Toulouse
3. Fermat's Last Theorem
4. Diophantine equation
5. Fermat's little theorem
6. Fermat number
7. Grandes écoles
8. Blaise Pascal
9. France
10. Pseudoprime
11. Lagrange's four-square theorem
12. Number theory
13. Fermat polygonal number theorem
14. Holographic will
15. Diophantus
16. Euler's theorem
17. Pell's equation
18. Fermat's theorem on sums of two squares
19. Fermat's spiral
20. Fermat's factorization method
Avg. Mark 7.3 ± 0.3
Std. dev. 1.7
SymGreen
RankArticle
1. Pierre de Fermat
2. Mathematics
3. Probability theory
4. Fermat's Last Theorem
5. Number theory
6. Toulouse
7. Diophantine equation
8. Blaise Pascal
9. Fermat's little theorem
10. Calculus
11. Diophantus
12. Statistics
13. Geometry
14. 17th century
15. Fermat number
16. 1601
17. Mathematician
18. 1665
19. Probability
20. Grandes écoles
Avg. Mark 7.0 ± 0.3
Std. dev. 1.7
Cosine
RankArticle
1. Pierre de Fermat
2. ENSICA
3. Fermat's theorem
4. International School of Toulouse
5. École Nationale Supérieure d'Électronique, d'Électrotechnique, d'Informatique, d'Hydraulique, et de Télécommunications
6. Languedoc
7. Hélène Pince
8. Community of Agglomeration of Greater Toulouse
9. Lilhac
10. Institut d'études politiques de Toulouse
11. Bonhoure Radio Tower
12. École Nationale de la Statistique et de l'Administration Économique
13. Cathédrale Saint-Étienne de Toulouse
14. List of Pink Cities
15. Number theory
16. Battle of Toulouse (1814)
17. Wieferich prime
18. Jean-Baptiste Dortignacq
19. Saint-Jean, Haute-Garonne
20. European Physiology Modules
Avg. Mark 2.9 ± 0.4
Std. dev. 2.0
Cocitations
RankArticle
1. Pierre de Fermat
2. Leonhard Euler
3. Mathematics
4. René Descartes
5. Mathematician
6. Gottfried Leibniz
7. Calculus
8. Isaac Newton
9. Blaise Pascal
10. Carl Friedrich Gauss
11. Number theory
12. Euclid
13. Geometry
14. France
15. Joseph Louis Lagrange
16. Diophantus
17. Fermat's Last Theorem
18. Algebra
19. Archimedes
20. Differential equation
Avg. Mark 5.4 ± 0.4
Std. dev. 2.1
PageRankOfLinks
RankArticle
1. France
2. 17th century
3. March 4
4. January 12
5. August 17
6. Calculus
7. Lawyer
8. 1660
9. Number theory
10. René Descartes
11. Probability theory
12. Carl Friedrich Gauss
13. 1665
14. Toulouse
15. 1601
16. Blaise Pascal
17. Analytic geometry
18. Geometric progression
19. Parlement
20. Pierre de Fermat
Avg. Mark 2.5 ± 0.3
Std. dev. 1.9

Base article: Star Wars

Number of votes: 54

Green
RankArticle
1. Star Wars
2. Dates in Star Wars
3. Palpatine
4. Jedi
5. Expanded Universe (Star Wars)
6. Star Wars Episode I: The Phantom Menace
7. Star Wars Episode IV: A New Hope
8. Obi-Wan Kenobi
9. Star Wars Episode III: Revenge of the Sith
10. Coruscant
11. Anakin Skywalker
12. Lando Calrissian
13. Luke Skywalker
14. Star Wars: Clone Wars
15. List of Star Wars books
16. George Lucas
17. Star Wars Episode II: Attack of the Clones
18. Splinter of the Mind's Eye
19. List of Star Wars comic books
20. The Force (Star Wars)
Avg. Mark 7.4 ± 0.3
Std. dev. 1.6
SymGreen
RankArticle
1. Star Wars
2. Jedi
3. Dates in Star Wars
4. Expanded Universe (Star Wars)
5. Star Wars Episode IV: A New Hope
6. Science fiction
7. Palpatine
8. George Lucas
9. Star Wars Episode I: The Phantom Menace
10. Star Wars Episode III: Revenge of the Sith
11. Film producer
12. Star Wars Episode VI: Return of the Jedi
13. Obi-Wan Kenobi
14. Luke Skywalker
15. The Force (Star Wars)
16. Galactic Empire (Star Wars)
17. 2005
18. Star Wars Episode II: Attack of the Clones
19. Coruscant
20. 1980s
Avg. Mark 6.9 ± 0.2
Std. dev. 1.2
Cosine
RankArticle
1. Star Wars
2. Expanded Universe (Star Wars)
3. Star Wars Episode IV: A New Hope
4. Star Wars Episode VI: Return of the Jedi
5. Dates in Star Wars
6. Original trilogy (Star Wars)
7. Star Wars Episode V: The Empire Strikes Back
8. Darth Vader
9. Anakin Skywalker
10. Yoda
11. C-3PO
12. Obi-Wan Kenobi
13. Star Wars Episode I: The Phantom Menace
14. Star Wars Episode III: Revenge of the Sith
15. Themes in Star Wars
16. The Force (Star Wars)
17. Fan criticism of George Lucas
18. List of Star Wars planets (M-N)
19. List of Star Wars Old Republic characters
20. Luke Skywalker
Avg. Mark 7.8 ± 0.2
Std. dev. 1.4
Cocitations
RankArticle
1. Star Wars
2. United States
3. Science fiction
4. Jedi
5. Dates in Star Wars
6. Expanded Universe (Star Wars)
7. Luke Skywalker
8. Darth Vader
9. Palpatine
10. Star Wars Episode IV: A New Hope
11. Galactic Empire (Star Wars)
12. Star Trek
13. George Lucas
14. 2005
15. Obi-Wan Kenobi
16. 2006
17. English language
18. Galactic Republic (Star Wars)
19. Han Solo
20. Computer and video games
Avg. Mark 4.7 ± 0.3
Std. dev. 2.0
PageRankOfLinks
RankArticle
1. 2006
2. 2005
3. Germany
4. England
5. World War II
6. 2003
7. Italy
8. Europe
9. Australia
10. World War I
11. 1985
12. 1984
13. 1977
14. Earth
15. 1978
16. Norway
17. Ancient Rome
18. Vietnam War
19. December 31
20. Nazism
Avg. Mark 0.6 ± 0.2
Std. dev. 1.0

Base article: Theory of relativity

Number of votes: 58

Green
RankArticle
1. Theory of relativity
2. Special relativity
3. General relativity
4. Spacetime
5. Lorentz covariance
6. Albert Einstein
7. Principle of relativity
8. Electromagnetism
9. Lorentz transformation
10. Inertial frame of reference
11. Speed of light
12. Galilean transformation
13. Local symmetry
14. Category:Relativity
15. Galilean invariance
16. Gravitation
17. Global symmetry
18. Tensor
19. Maxwell's equations
20. Introduction to general relativity
Avg. Mark 8.1 ± 0.2
Std. dev. 1.4
SymGreen
RankArticle
1. Theory of relativity
2. Special relativity
3. General relativity
4. Spacetime
5. Albert Einstein
6. Physics
7. Principle of relativity
8. Euclidean space
9. Vector (spatial)
10. Lorentz covariance
11. Electromagnetism
12. Speed of light
13. Gravitation
14. Inertial frame of reference
15. Category:Relativity
16. Electric charge
17. 20th century
18. Classical mechanics
19. Lorentz transformation
20. Differential geometry and topology
Avg. Mark 7.7 ± 0.3
Std. dev. 1.5
Cosine
RankArticle
1. Theory of relativity
2. Principle of relativity
3. Special principle of relativity
4. Faster-than-light
5. Galilean invariance
6. Global Lorentz covariance
7. Postulates of special relativity
8. General coordinate invariance
9. Local Lorentz covariance
10. Autodynamics
11. Frame of reference
12. Hafele-Keating experiment
13. Principle of inertia (physics)
14. Four-velocity
15. World line
16. Inertia
17. Hyperbolic motion (relativity)
18. Length contraction
19. Local reference frame
20. Lorentz covariance
Avg. Mark 6.7 ± 0.3
Std. dev. 1.6
Cocitations
RankArticle
1. Theory of relativity
2. Albert Einstein
3. Physics
4. Quantum mechanics
5. General relativity
6. Spacetime
7. Special relativity
8. Gravitation
9. Speed of light
10. Mathematics
11. United States
12. Isaac Newton
13. Philosophy
14. Science
15. Electron
16. Time
17. Mass
18. Universe
19. Atom
20. Physicist
Avg. Mark 6.1 ± 0.4
Std. dev. 2.2
PageRankOfLinks
RankArticle
1. 1915
2. 1908
3. 1905
4. 1907
5. Gravitation
6. Light
7. Geometry
8. Albert Einstein
9. Speed of light
10. General relativity
11. Special relativity
12. Electromagnetism
13. Vacuum
14. Galileo Galilei
15. Matter
16. Spacetime
17. Maxwell's equations
18. Tensor
19. Theory of relativity
20. Differential geometry and topology
Avg. Mark 2.7 ± 0.3
Std. dev. 2.0

Base article: 1989

Number of votes: 58

Green
RankArticle
1. 1989
2. Cold War
3. 1912
4. Tiananmen Square protests of 1989
5. Soviet Union
6. German Democratic Republic
7. George H. W. Bush
8. 1903
9. Communism
10. 1908
11. 1929
12. Ruhollah Khomeini
13. March 1
14. Czechoslovakia
15. June 4
16. The Satanic Verses (novel)
17. 1902
18. November 7
19. October 9
20. March 14
Avg. Mark 5.4 ± 0.3
Std. dev. 2.0
SymGreen
RankArticle
1. 1989
2. Cold War
3. 1912
4. 1980s
5. 1908
6. 1903
7. Soviet Union
8. George H. W. Bush
9. German Democratic Republic
10. 1901
11. 1929
12. Tiananmen Square protests of 1989
13. 1898
14. 1910
15. 1904
16. 1911
17. 1906
18. Berlin Wall
19. March 1
20. 1921
Avg. Mark 3.8 ± 0.3
Std. dev. 2.0
Cosine
RankArticle
1. 1989
2. List of historical anniversaries
3. Calendars of 2007
4. List of town tramway (urban tramway, streetcar) systems - Europe
5. List of town tramway (urban tramway, streetcar) systems - Africa and Asia
6. 1990
7. List of Stewards of the Chiltern Hundreds
8. List of television ratings during the Monday Night Wars
9. List of English Test cricketers
10. List of town tramway (urban tramway, streetcar) systems - Oceania, North America and South America
11. 1991
12. List of Nobel Prize in Physics winners by longevity
13. List of Stewards of the Manor of Northstead
14. Traditional Catholic Calendar
15. 2004
16. 1979
17. 1983
18. List of Nobel Prize in Chemistry winners by longevity
19. 1986
20. London Lighting-up Times
Avg. Mark 2.1 ± 0.3
Std. dev. 2.0
Cocitations
RankArticle
1. 1989
2. 1990
3. 1991
4. 1993
5. 1992
6. 2005
7. 1988
8. 1994
9. 1995
10. 2004
11. 1987
12. 1997
13. 1996
14. Category:Living people
15. 1999
16. 2003
17. 1998
18. 2001
19. United States
20. 2000
Avg. Mark 1.9 ± 0.4
Std. dev. 2.2
PageRankOfLinks
RankArticle
1. United States
2. United Kingdom
3. France
4. Germany
5. Japan
6. Europe
7. India
8. Australia
9. Russia
10. China
11. Soviet Union
12. New York City
13. Netherlands
14. Egypt
15. Poland
16. Sweden
17. 1945
18. California
19. 1989
20. Ireland
Avg. Mark 1.1 ± 0.3
Std. dev. 1.5