Can LLMs Handle the Great Depression? Evaluating Economic Reasoning via GDPVal




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • Welcome!
  • Can we really detect LLM-Generated Text ?
  • R squared in Machine Learning
  • Training a simple bigram character level model on tiny stories