• Abstracts: NeurIPS 2024 with Jindong Wang and Steven Euijong Whang

  • Dec 13 2024
  • Length: 12 mins
  • Podcast

Abstracts: NeurIPS 2024 with Jindong Wang and Steven Euijong Whang

  • Summary

  • Researcher Jindong Wang and Associate Professor Steven Euijong Whang explore the NeurIPS 2024 work ERBench. ERBench leverages relational databases to create LLM benchmarks that can verify model rationale via keywords in addition to checking answer correctness.

    Read the paper

    Get datasets and codes

    Show More Show Less
activate_Holiday_promo_in_buybox_DT_T2

What listeners say about Abstracts: NeurIPS 2024 with Jindong Wang and Steven Euijong Whang

Average customer ratings

Reviews - Please select the tabs below to change the source of reviews.