Mousehub

mal099 commented on Wilds of Eldraine | Episode 1: Pure of Heart • •

It's really neat that both of these have actual full audio versions this time, I appreciate them doing that!

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • •

True, GPT does not return a "yes" or "no" 100% of the time in either case, but that's not the point. The point is that it's impossible to say if GPT has actually gotten better or worse at predicting prime numbers with their test set. Since the test set is composed of only prime numbers, we do not know if GPT is more likely to call a number "prime" when it actually is a prime number than when it isn't. All we know is that it was very likely to answer "yes" to the question "is this number prime?" in March, and very likely to answer "no" in July. We do not know if the number makes a difference.

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • •

@rastilin is making some unproven assumptions here. But it is true that the "math question" dataset consists only of prime numbers, so if the first version thought every number was prime and the second thought no numbers were prime, we would see this exact behavior. Source:

For this dataset, we query the primality of 500 randomly chosen primes between 1,000 and 20,000; the correct answer is always Yes.

From Zhang et al. (2023), the paper they took the dataset from.

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • •

Damn, you're right. The study has not been peer reviewed yet according to the article, and in my opinion, it really shows. For anyone who doesn't want to actually read the study:

They took the set of questions from a different study (which is fine). The original study had a set of 500 randomly chosen prime numbers and asked ChatGPT if they were prime, and to support its reasoning. They did this to see if in the cases where ChatGPT got the question wrong, ChatGPT would try to support its wrong answer with more faulty reasoning - a dataset with only prime numbers is perfectly fine for this initial question.

The study in the article appears to be trying to answer two questions - is there significant drift in the answers ChatGPT gives, and is ChatGPT getting better or worse at answering questions. The dataset is perfectly fine for answering the first question, but completely inadequate for answering the second, since an AI that simply thinks all numbers are prime would be judged as having perfect accuracy! Some good peer review would never let that kind of thing slide.

mal099 commented on [CMM] (NEW) Battle at the Helvault • •

Can exile your own stuff, so neat synergy with blink decks!

mal099 commented on [CMM] (NEW) Rukarumel, Biologist • •

Could be fun with mercenaries or rebels. Just tutor for whatever you want.

mal099 commented on preach • •

I would steal this argument, but if it can be reposted here for free, then I don't think anybody really owns it. 🤔

mal099 commented on 14 December 1985 • •

I think it's called "Buckles" in English. https://en.wikipedia.org/wiki/Buckles_(comics)
"Chiffon" in French.

mal099 commented on Go visit the mtgzone.com communities! • •

I'm getting an error 404 from your link (on kbin). Does this work?
Edit: Works for me at least!

Last page

mal099

mal099 commented on Wilds of Eldraine | Episode 1: Pure of Heart • mtg •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • tech •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • tech •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • tech •

mal099 commented on [CMM] (NEW) Battle at the Helvault • spoilers •

mal099 commented on [CMM] (NEW) Rukarumel, Biologist • spoilers •

mal099 commented on preach • piracy •

mal099 commented on 14 December 1985 • calvinandhobbes •

mal099 commented on Go visit the mtgzone.com communities! • magictcg •

mal099 commented on Wilds of Eldraine | Episode 1: Pure of Heart • mtg •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • tech •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • tech •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • tech •

mal099 commented on [CMM] (NEW) Battle at the Helvault • spoilers •

mal099 commented on [CMM] (NEW) Rukarumel, Biologist • spoilers •

mal099 commented on preach • piracy •

mal099 commented on 14 December 1985 • calvinandhobbes •

mal099 commented on Go visit the mtgzone.com communities! • magictcg •

mal099

mal099 commented on Wilds of Eldraine | Episode 1: Pure of Heart • •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • •

mal099 commented on [CMM] (NEW) Battle at the Helvault • •

mal099 commented on [CMM] (NEW) Rukarumel, Biologist • •

mal099 commented on preach • •

mal099 commented on 14 December 1985 • •

mal099 commented on Go visit the mtgzone.com communities! • •

mal099 commented on Wilds of Eldraine | Episode 1: Pure of Heart • •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • •

mal099 commented on Stanford Scientists Find That Yes, ChatGPT Is Getting Stupider • •

mal099 commented on [CMM] (NEW) Battle at the Helvault • •

mal099 commented on [CMM] (NEW) Rukarumel, Biologist • •

mal099 commented on preach • •

mal099 commented on 14 December 1985 • •

mal099 commented on Go visit the mtgzone.com communities! • •