MillionBase 2017

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

User avatar
Rebel
Posts: 6997
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: MillionBase 2017

Post by Rebel »

Michael Sherwin wrote:
Rebel wrote:MillionBase 2017 - 2.9 million quality chess games in PGN format. The 2 yearly TWIC update, added 350.000 new games from the years 2016 and 2017. Database up to date till January 2018.

http://rebel13.nl/download/pgn.html
Hi Ed, Thank you for this! I have a question though. By what method or standard is the quality judged? If I could have my wish it would only be 1 million games or even much less if the quality of every single game was of the highest quality. There can be bad mistakes by one side as long as the other side took advantage of the mistakes, etc.
It's a database of human-human games only. The first 1.5 million games came from the Chess Assistant guys, so quality comparable with MegaBase from ChessBase. Thereafter it was updated with TWIC games mainly.

There are several utils available (such as my own ProTools) to filter on ELO and create a PGN database between (say) 2700+ ELO rated GM's only.

From experience I know the quality of COMP-COMP games between high rated chess programs (say from 3000+ ELO) is better than human games. Have a look at - http://rebel13.nl/misc/protools16a.html
Norbert Raimund Leisner
Posts: 1643
Joined: Tue May 20, 2008 4:57 pm
Location: Augsburg - Germany

Re: MillionBase 2017

Post by Norbert Raimund Leisner »

PGN cf. especially the contribution by Steven J. Edwards here:
http://www.saremba.de/chessgml/standard ... mplete.htm

Online Databases
http://computer-chess.org/doku.php?id=c ... tions_list

PGN tools
http://computer-chess.org/doku.php?id=c ... riter_list

Best wishes,
Norbert
User avatar
gbtami
Posts: 389
Joined: Wed Sep 26, 2012 1:29 pm
Location: Hungary

Re: MillionBase 2017

Post by gbtami »

Rebel wrote:It's a database of human-human games only.
Minor nitpicking. It contains some chess puzzles (compositions) from 2008.01.01 Corus event and two human-engine games from 2000 (Jussupov vs FRITZ on Primergy) and some Chess960 games from several other events. Btw. in these Shuffle/Chess960 games there is no [Variant "Chess960"] tag at all and the starting FEN is always describes castling availability with a single "-". I think this is not correct. A solution can be to remove all variant games from mb-2017.pgn and collect them in a separate PGN with corrected castling. What do you think?
Vinvin
Posts: 5228
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: MillionBase 2017

Post by Vinvin »

Rebel wrote:MillionBase 2017 - 2.9 million quality chess games in PGN format. The 2 yearly TWIC update, added 350.000 new games from the years 2016 and 2017. Database up to date till January 2018.

http://rebel13.nl/download/pgn.html
Some error from SCID while parsing :
Importing from mb-2017.pgn...
Imported 2919420 games.
PGN errors/warnings:
(game 2457898, line 40782294) Warning: illegal castling (O-O-O) in game Kulaots, K - Janocha, W, 2014
(game 2458814, line 40798615) Warning: illegal castling (O-O-O) in game Pujari, R - Nguyen Van Huy, 2014
(game 2594641, line 43195006) Warning: illegal castling (O-O) in game Cavendish, Joshua Z - Jones, G, 2015
(game 2661436, line 44379414) Warning: illegal castling (O-O) in game Costachi, M - Vasiliu, Cosmin, 2016
User avatar
Rebel
Posts: 6997
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: MillionBase 2017

Post by Rebel »

Vinvin wrote:
Rebel wrote:MillionBase 2017 - 2.9 million quality chess games in PGN format. The 2 yearly TWIC update, added 350.000 new games from the years 2016 and 2017. Database up to date till January 2018.

http://rebel13.nl/download/pgn.html
Some error from SCID while parsing :
Importing from mb-2017.pgn...
Imported 2919420 games.
PGN errors/warnings:
(game 2457898, line 40782294) Warning: illegal castling (O-O-O) in game Kulaots, K - Janocha, W, 2014
(game 2458814, line 40798615) Warning: illegal castling (O-O-O) in game Pujari, R - Nguyen Van Huy, 2014
(game 2594641, line 43195006) Warning: illegal castling (O-O) in game Cavendish, Joshua Z - Jones, G, 2015
(game 2661436, line 44379414) Warning: illegal castling (O-O) in game Costachi, M - Vasiliu, Cosmin, 2016
I was aware of these 4 and did not know what to do with them. Cases are when a rook moves, then x moves later the rook returns to its start square and then x moves later castling takes place. So illegal. But what if these games are really played that way? Deleting them would result in an incomplete tournament. Typical -/- situation.
Vinvin
Posts: 5228
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: MillionBase 2017

Post by Vinvin »

Rebel wrote:
Vinvin wrote:
Rebel wrote:MillionBase 2017 - 2.9 million quality chess games in PGN format. The 2 yearly TWIC update, added 350.000 new games from the years 2016 and 2017. Database up to date till January 2018.

http://rebel13.nl/download/pgn.html
Some error from SCID while parsing :
Importing from mb-2017.pgn...
Imported 2919420 games.
PGN errors/warnings:
(game 2457898, line 40782294) Warning: illegal castling (O-O-O) in game Kulaots, K - Janocha, W, 2014
(game 2458814, line 40798615) Warning: illegal castling (O-O-O) in game Pujari, R - Nguyen Van Huy, 2014
(game 2594641, line 43195006) Warning: illegal castling (O-O) in game Cavendish, Joshua Z - Jones, G, 2015
(game 2661436, line 44379414) Warning: illegal castling (O-O) in game Costachi, M - Vasiliu, Cosmin, 2016
I was aware of these 4 and did not know what to do with them. Cases are when a rook moves, then x moves later the rook returns to its start square and then x moves later castling takes place. So illegal. But what if these games are really played that way? Deleting them would result in an incomplete tournament. Typical -/- situation.
Funny :-)
That remember me this game : http://www.chessgames.com/perl/chessgame?gid=1039605
Dreev played 30...O-O (an illegal move) but Kasparov did not object so no forfeit was enforced. Dreev won some moves later.
Dann Corbit
Posts: 12542
Joined: Wed Mar 08, 2006 8:57 pm
Location: Redmond, WA USA

Re: MillionBase 2017

Post by Dann Corbit »

Rebel wrote:
Vinvin wrote:
Rebel wrote:MillionBase 2017 - 2.9 million quality chess games in PGN format. The 2 yearly TWIC update, added 350.000 new games from the years 2016 and 2017. Database up to date till January 2018.

http://rebel13.nl/download/pgn.html
Some error from SCID while parsing :
Importing from mb-2017.pgn...
Imported 2919420 games.
PGN errors/warnings:
(game 2457898, line 40782294) Warning: illegal castling (O-O-O) in game Kulaots, K - Janocha, W, 2014
(game 2458814, line 40798615) Warning: illegal castling (O-O-O) in game Pujari, R - Nguyen Van Huy, 2014
(game 2594641, line 43195006) Warning: illegal castling (O-O) in game Cavendish, Joshua Z - Jones, G, 2015
(game 2661436, line 44379414) Warning: illegal castling (O-O) in game Costachi, M - Vasiliu, Cosmin, 2016
I was aware of these 4 and did not know what to do with them. Cases are when a rook moves, then x moves later the rook returns to its start square and then x moves later castling takes place. So illegal. But what if these games are really played that way? Deleting them would result in an incomplete tournament. Typical -/- situation.
Also some closing quote problems:
F:\pgn\millionbase>eco -C -s -ef:\tmp\eco.pgn mb-2017.pgn -w72 -o mb2017.pgn
Missing closing quote in [Event "Munich 30"]
Missing closing quote in [Site "Munich 30"]
Missing closing quote in [Event "Tubingen 2"]
Missing closing quote in [Site "Tubingen 2"]
Missing closing quote in [White "Van Eijk, Maurice"]
Queenside castling is forbidden to White.
File mb-2017.pgn: Line number: 40828294
Failed to make move 18. O-O-O in the game:
r....rk.
pp.n.pnp
..p..bp.
..Pp....
.P.P....
.N.....P
.P...PPB
R...KB.R

Kulaots, K - Janocha, W European Rapid 2014 Wroclaw POL 2014.12.21
Queenside castling is forbidden to White.
File mb-2017.pgn: Line number: 40844616
Failed to make move 26. O-O-O in the game:
r....bk.
.q.b.pp.
.....n.p
P.p.....
.pB.P...
.N...P.P
..QB..r.
R...K..R

Pujari, R - Nguyen Van Huy 7th Chennai Open 2014-15 Chennai IND 2014.12.30
Kingside castling is forbidden to White.
File mb-2017.pgn: Line number: 43241006
Failed to make move 18. O-O in the game:
r.b..b.r
p...k..p
.p..p.p.
..p.....
....NP..
...N....
PP.P..PP
R.B.K..R

Cavendish, Joshua Z - Jones, G London Classic Superrapid London ENG 2015.12.12
Kingside castling is forbidden to Black.
File mb-2017.pgn: Line number: 44425414
Failed to make move 29... O-O in the game:
...bk..r
pQ...p..
.n....pp
...pN...
...P....
.P.....P
P....KP.
........

Costachi, M - Vasiliu, Cosmin 26th Seacoast Trophy 2016 Eforie Nord ROU 2016.07.01
Games: 2919420


Still, with 2,919,420 games it is a remarkably clean data set.
Taking ideas is not a vice, it is a virtue. We have another word for this. It is called learning.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
Ferdy
Posts: 4833
Joined: Sun Aug 10, 2008 3:15 pm
Location: Philippines

Re: MillionBase 2017

Post by Ferdy »

pgn extract also has some warnings.

Code: Select all

Warning: Result of 1/2-1/2 is inconsistent with checkmate by black in
Ahmedzade, A. - Zarubica, L. Herculane ch-EUw U14 Herculane ch-EUw U14 1994.01.0
1
File mb-2017.pgn: Line number: 136244
Warning: Result of 1-0 is inconsistent with checkmate by black in
Ankerst, Milka - Demina, Julia Palma de Mallorca Palma de Mallorca 1992.01.01
File mb-2017.pgn: Line number: 410128
Warning: Result of 1-0 is inconsistent with checkmate by black in
Atlas, D. - Oratovsky, M. Cappelle la Grande open Cappelle la Grande open 1996.0
1.01
File mb-2017.pgn: Line number: 579294
Warning: Result of 1-0 is inconsistent with checkmate by black in
Birens, E. - Anselmo, A. Hyeres op Hyeres op 1992.01.01
File mb-2017.pgn: Line number: 1276145
Warning: Result of 1-0 is inconsistent with checkmate by black in
Bodini - Biggi corr Italy corr Italy 1969.01.01
File mb-2017.pgn: Line number: 1406286
Warning: Result of 0-1 is inconsistent with checkmate by white in
Bojadgieva, Rumiana - Polihroniade, E. Veliko Trnovo wzt Veliko Trnovo wzt 1985.
01.01
File mb-2017.pgn: Line number: 1463979
Warning: Result of 1-0 is inconsistent with checkmate by black in
Broek, T. - Siekerman, P. Alkmaar open Alkmaar open 1982.01.01
File mb-2017.pgn: Line number: 1719619
Warning: Result of 1-0 is inconsistent with checkmate by black in
Capece - Pederzoli Reggio Emilia tt Reggio Emilia tt 1972.01.01
File mb-2017.pgn: Line number: 2006378
Warning: Result of 0-1 is inconsistent with checkmate by white in

[more ...]
One example.
[pgn]
[Event "Palma de Mallorca"]
[Site "Palma de Mallorca"]
[Date "1992.01.01"]
[Round "1"]
[White "Ankerst, Milka"]
[Black "Demina, Julia"]
[Result "1-0"]
[ECO "B57"]
[PlyCount "68"]
[EventDate "1992.??.??"]

1. e4 c5 2. Nf3 Nc6 3. d4 cxd4 4. Nxd4 Nf6 5. Nc3 d6 6. Bc4 Qb6 7. Nb3 e6 8.
Be3 Qc7 9. f3 Be7 10. Qd2 a6 11. O-O-O O-O 12. g4 Ne5 13. Be2 b5 14. g5 Nfd7
15. f4 Nc4 16. Qd3 Nxe3 17. Qxe3 b4 18. Nb1 a5 19. Nd4 Ba6 20. Bxa6 Rxa6 21.
Rhg1 Rc8 22. Rg2 a4 23. a3 Qc5 24. h4 Rb6 25. axb4 Qxb4 26. Qa3 Qa5 27. Nd2 Bf8
28. f5 e5 29. Ne2 d5 30. Qa2 Qc5 31. c3 a3 32. Kb1 Qb5 33. Nf1 axb2 34. Qxb2
Qxb2# 1-0
[/pgn]
jdart
Posts: 4367
Joined: Fri Mar 10, 2006 5:23 am
Location: http://www.arasanchess.org

Re: MillionBase 2017

Post by jdart »

For the first one you might have the wrong score. I do not see any illegal castling here (or maybe it is a different game - this is the only Kulaots-Janocha in my db?):

[Event "EU-ch rapid"]
[Site "Wroclaw"]
[Date "2014.12.21"]
[Round "8"]
[White "Kulaots, Kaido"]
[Black "Janocha, Wieslaw Adam"]
[Result "1/2-1/2"]
[ECO "D11"]
[WhiteElo "2563"]
[BlackElo "2269"]
[PlyCount "34"]
[EventDate "2014.12.20"]
[EventType "swiss (rapid)"]
[EventRounds "11"]
[EventCountry "POL"]
[SourceTitle "CBM 164 Extra"]
[Source "ChessBase"]
[SourceDate "2015.03.02"]
[SourceVersion "1"]
[SourceVersionDate "2015.03.02"]
[SourceQuality "1"]

1. d4 d5 2. c4 c6 3. Nf3 Nf6 4. Qc2 g6 5. Bf4 Bf5 6. Qb3 Qb6 7. e3 Qxb3 8. axb3
Bxb1 9. Rxb1 e6 10. c5 Nbd7 11. h3 Be7 12. Ra1 O-O 13. Nd2 Nh5 14. Bh2 e5 15.
b4 exd4 16. exd4 Bf6 17. Nb3 Ng7 1/2-1/2
MikeGL
Posts: 1010
Joined: Thu Sep 01, 2011 2:49 pm

Re: MillionBase 2017

Post by MikeGL »

jdart wrote:For the first one you might have the wrong score. I do not see any illegal castling here (or maybe it is a different game - this is the only Kulaots-Janocha in my db?):

[Event "EU-ch rapid"]
[Site "Wroclaw"]
[Date "2014.12.21"]
[Round "8"]
[White "Kulaots, Kaido"]
[Black "Janocha, Wieslaw Adam"]
[Result "1/2-1/2"]
[ECO "D11"]
[WhiteElo "2563"]
[BlackElo "2269"]
[PlyCount "34"]
[EventDate "2014.12.20"]
[EventType "swiss (rapid)"]
[EventRounds "11"]
[EventCountry "POL"]
[SourceTitle "CBM 164 Extra"]
[Source "ChessBase"]
[SourceDate "2015.03.02"]
[SourceVersion "1"]
[SourceVersionDate "2015.03.02"]
[SourceQuality "1"]

1. d4 d5 2. c4 c6 3. Nf3 Nf6 4. Qc2 g6 5. Bf4 Bf5 6. Qb3 Qb6 7. e3 Qxb3 8. axb3
Bxb1 9. Rxb1 e6 10. c5 Nbd7 11. h3 Be7 12. Ra1 O-O 13. Nd2 Nh5 14. Bh2 e5 15.
b4 exd4 16. exd4 Bf6 17. Nb3 Ng7 1/2-1/2
I checked with Millionbase, it's 1 move longer than the above posted game,
ended in 18th move.

[Event "European Rapid 2014"]
[Site "Wroclaw POL"]
[Date "2014.12.21"]
[Round "8.41"]
[White "Kulaots, K"]
[Black "Janocha, W"]
[Result "1/2-1/2"]
[WhiteElo "2563"]
[BlackElo "2269"]
[ECO "D11"]
[EventDate "2014.12.20"]
[WhiteTitle "GM"]
[BlackTitle "FM"]
[WhiteFideId "4500261"]
[BlackFideId "1100548"]

1.d4 d5 2.c4 c6 3.Nf3 Nf6 4.Qc2 g6 5.Bf4 Bf5 6.Qb3 Qb6 7.e3 Qxb3 8.axb3
Bxb1 9.Rxb1 e6 10.c5 Nbd7 11.h3 Be7 12.Ra1 O-O 13.Nd2 Nh5 14.Bh2 e5 15.b4
exd4 16.exd4 Bf6 17.Nb3 Ng7 18.O-O-O Ne6 1/2-1/2