The centerpiece of Negro League data are a set of .csv files which summarize game-level data for all (7,061) Negro League games for which Retrosheet has compiled data. There are seven such .csv files.
The columns are labeled and should be mostly self-explanatory. But, in case not, the columns are defined here.
These seven files can be downloaded here: Negro League CSV Download.
In addition, traditional event files and box-score event files can be downloaded for Negro League games for which play-by-play and/or box-score data exists. As of January 1, 2025, Retrosheet has box-score data for 3,766 Negro League games and play-by-play data for 1,282 Negro League games. The latter number here includes 1,132 games for which play-by-play data have been deduced from newspaper game stories and/or box scores.
Finally, one can download a single file which includes all of the aforementioned files along with biographical data, Negro League team rosters, and logs for individual ballparks, players, and teams. The ballpark logsonly include ballparks which hosted Negro League games, but include all games played at said ballparks. Similarly, player logs are only included for players who played in Negro League games (including some interracial barnstorming games) but the logs include all games played by these players for which Retrosheet has compiled data.
The level of detail at which Negro League data can be determined is highly variable across games and the data "known" is highly uncertain in many cases. For example, for many games, we have no box score but may have a reference to the fact that a particular player had at least one hit in the game. To attempt to convey this uncertainty in our data, teams and players may be given up to three sets of statistical lines for each game within the data files which are available for download. These are identified within the .csv files by the variable 'stattype'.
Back to Main Page for Negro League Baseball
Recipients of Retrosheet data are free to make any desired use of the information, including (but not limited to) selling it, giving it away, or producing a commercial product based upon the data. Retrosheet has one requirement for any such transfer of data or product development, which is that the following statement must appear prominently
The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at 20 Sunset Rd., Newark, DE 19711.
Retrosheet website last updated January 8, 2025.
All data contained at this site is copyright 1996-2025 by Retrosheet. All Rights Reserved. Click here for information about the use of Retrosheet data