This notebook is for selecting any boardgame and identifying its most comparable games using data from boardgamegeek.com. I use a dimension reduction method (PCA) to learn the main points of variation in data about games from boardgamegeek. To find similar games, I compute the distance between all games using their first twenty principal components. I use the distance between games to determine their overall similarity. Games that are close to each other are said to be neighbors.
This document details a game’s 25 nearest neighbors and illustrate some of the dimensions that make the selected game and its neighbors similar.
First, we can look at summary information for the game as it currently stands on BGG. For the average rating (Average), geek rating (Geek), and average weight (Complexity) of a game, I estimate the game’s values using predictive models trained on historical BGG data.
Outcomes | |||||||||
ID | Game | Published | Player Count | Playing Time | UserRatings | Type | Average | Geek | Complexity |
331363 | Captain's Log | 2022 | 1-4 | 180 min | 0 | Current | 0.00 | 0.00 | 0 |
Estimated | 7.96 | 6.72 | 3 |
Information on the game’s designers, artists, mechanics and categories. Note: designer and artist features are not used in identifying nearest neighbors.
ID | Game | Published | Publisher(s) | Designer(s) | Artist(s) | Categories | Mechanics |
331363 | Captain's Log | 2022 | Selfpublished | Nautical | Action Points | ||
Transportation | Pickup And Deliver | ||||||
Civilization | Modular Board | ||||||
Exploration | Hexagon Grid | ||||||
Adventure | Area Movement | ||||||
Fighting | Dice Rolling | ||||||
Pirates | Grid Movement | ||||||
Age Of Reason | Solo Solitaire Game | ||||||
Actionevent | |||||||
Race | |||||||
Variable Setup | |||||||
Bias | |||||||
Map Addition | |||||||
For the full profile of the selected game, go to https://boardgamegeek.com/boardgame/331363
The table below displays the most similar games to Captain’s Log using data from boardgamegeek (BGG). The reported similarity score is the (squared) inverse of the Manhattan distance between two games.
Note: The analysis does not make use of a game’s number of ratings, average, or geek average, but instead looks only at a game’s categories, mechanics, playing time, player count, and complexity. The current approach for computing the distance weighs each component equally; I plan to explore a change to this in the future by using different measures of distance, as we would reasonably expect some components to be more important than others. I initially relied on Euclidean distance, but in tinkering with the results I have opted to go with the Manhattan distance for now.
Rank | Similarity | Published | ID | Neighbor | BGGRating | GeekRating | Complexity |
1 | 34.40 | 2010 | 25292 | Merchants & Marauders | 7.40 | 7.14 | 3.24 |
2 | 32.44 | 2020 | 282922 | Windward | 7.31 | 5.82 | 2.30 |
3 | 30.80 | 2017 | 201186 | Summit: The Board Game | 7.15 | 5.89 | 2.33 |
4 | 28.91 | 2021 | 259962 | Stress Botics | 8.40 | 5.55 | 4.38 |
5 | 28.47 | 2014 | 82222 | Xia: Legends of a Drift System | 7.88 | 7.39 | 3.18 |
6 | 28.26 | 1988 | 230 | Merchant of Venus | 7.14 | 6.49 | 2.84 |
7 | 27.82 | 2016 | 193558 | The Oracle of Delphi | 7.29 | 6.80 | 2.98 |
8 | 27.40 | 2021 | 275557 | The Last Bottle of Rum | 7.48 | 5.64 | 2.00 |
9 | 27.40 | 2017 | 226320 | My Little Scythe | 7.33 | 6.79 | 1.99 |
10 | 27.34 | 2016 | 169786 | Scythe | 8.22 | 8.06 | 3.43 |
11 | 27.25 | 2012 | 131646 | Merchant of Venus (Second Edition) | 7.16 | 6.62 | 3.00 |
12 | 27.24 | 2018 | 218509 | Empires of the Void II | 7.51 | 6.59 | 3.45 |
13 | 27.14 | 2015 | 141423 | Dead Men Tell No Tales | 7.12 | 6.52 | 2.50 |
14 | 26.98 | 2015 | 181530 | Runebound (Third Edition) | 7.49 | 6.93 | 2.69 |
15 | 26.91 | 2013 | 138161 | Firefly: The Game | 7.39 | 7.04 | 2.95 |
16 | 26.89 | 2019 | 271896 | Star Wars: Outer Rim | 7.70 | 7.20 | 2.50 |
17 | 26.88 | 2021 | 339031 | The Goonies: Never Say Die | 7.54 | 5.70 | 2.50 |
18 | 26.80 | 2016 | 170199 | Solarius Mission | 7.20 | 5.91 | 3.95 |
19 | 26.80 | 2021 | 281676 | Galactic Era | 7.79 | 5.55 | 3.89 |
20 | 26.74 | 1980 | 1152 | The Mystic Wood | 6.48 | 5.70 | 1.48 |
21 | 26.70 | 2014 | 149097 | Spurs: A Tale in the Old West | 6.94 | 5.71 | 2.56 |
22 | 26.64 | 2019 | 207991 | Quodd Heroes | 7.35 | 6.01 | 3.20 |
23 | 26.61 | 2021 | 309430 | Tiny Epic Pirates | 7.19 | 6.06 | 2.77 |
24 | 26.60 | 2012 | 121921 | Robinson Crusoe: Adventures on the Cursed Island | 7.82 | 7.65 | 3.80 |
25 | 26.57 | 1979 | 22 | Magic Realm | 7.21 | 6.35 | 4.53 |
Placing the selected game and its nearest neighbors on the first two principal components.
\[\\[0.05in]\]
Placing games on the first four principal components, which I have found to loosely map to complexity, theme, economy, and cooperation.
\[\\[0.05in]\]
The chart below shows how Captain’s Log compares to its nearest neighbors on each of the ten principal components. Similar games will have similar profiles in terms of their placement on each dimension. This can be a useful way to easily see the dimensions on which games resemble each other. I’ve also plotted games that are at the tails of each component to get a reference point.
We can also use a tile plot to view each of the neighbors across every principal component used, which can be a useful way of looking to see if there’s one dimension in particular on which the games stand out.
What explains a game’s score on each principal component?
To gain a better understanding of these principal components, we can look at the loadings for the variables in the dataset. These are the the contributions of each variable to the ten components used in computing the distance between games. Large loadings (either positive or negative) for a variable indicate that there is a strong relationship between that variable and the component.
For instance, on the first principal component (PC1), time per player, average weight, and playing time have the highest positive loadings, while party game and max players have negative loadings. This indicates that this component seems to map to a game’s complexity - longer, more complex games will have high positive scores on this component while simpler, shorter, party games with lots of players will have low scores.