@timestatic

timestatic@feddit.org · 1 day ago

Oh I actually just switched it up accidentally while typing. I read it right but still almost one out of three doesn’t get it

timestatic@feddit.org · 2 days ago

I mean if they fix specific reasoning test answers (like the strawberry one) this doesn’t actually make reasoning better tho. It just optimizes for benchmarks

timestatic@feddit.org · 2 days ago

Yeah seems like the training on human data makes it so most AIs will answer at least as unreliable as humans. 71% saying walk from the human side is crazy

timestatic@feddit.org · 3 days ago

Or one of the two collapses and the other one assumes power. Taiwan could concede which I don’t hope they do but technically its possible.