Too difficult for computers - Chess Forums - Page 6

Elroch

Oct 13, 2023

0

#101

Yeah, it's not very informative to let it analyse after you have told it the answer, just like if you did the same to a human. It uses what you tell it as part of the analysis, which helps enormously.

EndgameEnthusiast2357

Oct 13, 2023

0

#102

But the m41 appeared when I was first entering the moves, on its own, not from going back through the moves.

Elroch

Oct 13, 2023

0

#103

It will use whatever you have put in during the current session, no more.

EndgameEnthusiast2357

Oct 14, 2023

0

#104

Here's a simple endgame for humans to understand but stockfish has trouble with. However stockfish sometimes still gives the correct moves despite not declaring mates throughout:

Elroch

Oct 14, 2023

0

#105

"Simple" is probably not the word! "Possible" would be more appropriate!

Humans think in a different way. One key is we think of all the squares the knight can reach rather than all the positions that can be reached. The computer can only deal with positions where the two king positions and the knight position are specified. But that should not be too bad for it - without a pawn moving that is merely tens of thousands of positions. Part of the problem is the weakness of Stockfish at deep sacrifices. AIs may be better for this (while weaker at tactics). It's low priority for a key move that will be a problem. Here, it has to look at one sacrifice (in many king positions) work out that accepting it works out and then find the route to the second sacrifice and calculate the results after an acceptance or decline (semantically interesting that the noun for the declining of a sacrifice seems kind of odd).

EndgameEnthusiast2357

Oct 14, 2023

0

#106

Computers also seem to struggle in closed positions.

Arisktotle

Oct 14, 2023

0

#107

EndgameEnthusiast2357 wrote:

Computers also seem to struggle in closed positions.

Chess.com's new onboard engine Torch solved #109 in about 5 minutes! After resetting my computer which supposedly emptied the cash. But it is always possible that an engine keep a backup of the most recent analyses on the server!

EndgameEnthusiast2357

Oct 14, 2023

0

#108

Well actually stockfish gave most of the right moves, but just didn't declare it as immediately winning. The delay seems to be in the evaluation of the position even if it suggests the best moves or some of the best moves.

Arisktotle

Oct 14, 2023

0

#109

EndgameEnthusiast2357 wrote:

Well actually stockfish gave most of the right moves, but just didn't declare it as immediately winning. The delay seems to be in the evaluation of the position even if it suggests the best moves or some of the best moves.

That's only natural considering it is not a mate-in-n puzzle. Torch too, only reported a +4.6 score but that would probably indicate it saw the win as it started out with only +1.4 to reflect white's material superiority.

EndgameEnthusiast2357

Oct 14, 2023

0

#110

Yeah but if the mate would come within a small number of moves after, such after a pawn breaking through, it usually just adds that to the m long before. For example when I had stockfish play against itself in a queen vs Rook endgame, it gave m20+ long before the Rook was captured (since it is always a max of m10 with a queen vs lone king).

Arisktotle

Oct 14, 2023

0

#111

Not sure I understand your point. Anyway, I finished the analysis with just one display line and I screened it to the point where the knight reached the critical square f5. All very trustworthy!

Arisktotle

Oct 15, 2023

0

#112

Ah, I see your point. You are still on the trail of delivering mathematical proof which requires delivering checkmate - while the criteria for accepting "wins" in the composition environment are much softer. They will accept not just mates but tablebase endings, endgame theory and high engine evaluation ratings (+3 is sufficient) as satisfactory evidence. Plus evals by human experts. Of course, when you are testing an engine - especially a new one like Torch - you'd need a calibration system to verify the validity of its evaluation scores. Otherwise it's just Torch saying that Torch is winning! Or as we say in Holland "Toilet Duck recommends Toilet Duck" in the category of funniest (real) commercials ever.

The composition community is happy to take chances with top-grade evidence like medical science on the basis that "time will tell" and no one will die from an incidental lapse in the eval systems

Elroch

Oct 15, 2023

0

#113

Tablebase/endgame theory (should be basically same thing) is fine.

+3 evaluation is inadequate evidence for a good problem, IMHO. An AI would likely leave this significantly short of 100% chance of the result in many cases. Sure, you'll mostly be right. And sometimes you'll be wrong.

Dodi232

Oct 15, 2023

0

#114

Thats a really difficult one

Arisktotle

Oct 15, 2023

0

#115

Elroch wrote:

Tablebase/endgame theory (should be basically same thing) is fine.

+3 evaluation is inadequate evidence for a good problem, IMHO. An AI would likely leave this significantly short of 100% chance of the result in many cases. Sure, you'll mostly be right. And sometimes you'll be wrong.

In compositions there is an essential difference between "studies" and "problems". The latter do require proof though it's clear that such is not always attained - especially with complex modern constructions. The judgement of the former accepts a concoction of evidence which falls short of proof. That is why so many endgames were found cooked and seriously dualed when the engines rushed in - not just on tablebases. I found a number of flaws myself.

The idea of the +3 score comes in for the "loose ends" often diagnosed and selected by human experts and the composer himself. Commonly quiet endgame positions (e.g. a rook with 2 pawns against a well-placed knight with 3 pawns) which are near the draw/win divide. Many endgame studies have such endings in or besides the main lines which require reliable evaluations - often to eliminate unwanted duals. In previous centuries composers provided many pages of analysis to convince judges of their viewpoint. Today they'd rather hand that job to a strong engine or by cooperating with one to explore the darkest corners.

But the ultimate defense is in the "reversal of the burden of proof". After engines and human testers have done their job, the endgames are published and even prized and still everyone is invited till the end of times to apply all their resources to find the glitches that were missed. No verdict is final until some electronic beast in some future has confirmed or refuted it by some full proof analytical method.

Interestingly I am viewed as a composer of complex endgame studies though not many. Still today I am fearful that one of those might fall under the sword of an eager kid with a supercomputer. So sad

Mordicus1973

Oct 15, 2023

0

#116

Elroch a écrit :

Stockfish found another line for mate in 19 :

1.Be3+ Kb1 2.Bh6 b5 3.Ke7 b4 4.Kf6 b5 5.Kg5 Kc1 6.Kf5+ 6.Kg4+ Kb1 7.Kf4 Kc1 8.Kf3+ Kb1 9.Ke3 Kc1 10.Kd3+ Kb1 11.Be6 b3 12.Bxb3 b4 13.Bd5 b3 14.Kc3 Ka2 15.Bxb3+ Kb1 16.Bf4 h6 17.Bc4 h5 18.Kb3 h4 19.Bd3#

GMegasDoux

Oct 16, 2023

0

#117

That was amazing.

EndgameEnthusiast2357

Oct 27, 2023

0

#118

Forgive me if I posted this before, but it may have been a different thread. It's called the "kings grave" puzzle:

The computer has trouble seeing this one until about 8 moves away. I know not all the moves are forced, but the queen can always easily reposition to force the king to the a file in every case (too many to include every variation, but have checked them with stockfish, it's a forced mate even if the king runs back to the 3rd rank as he is cut off on the 4th rank anyway)

Elroch

Oct 28, 2023

0

#119

Delightful artificiality!

EndgameEnthusiast2357

Oct 28, 2023

0

#120

What's interesting is that stockfish instantly solves the simpler 12 move version of this Cascade-Underpromotion (below), but can't solve the 19 move one above with the few extra moves interspersed.