Practically Intelligence E7: The Power of Benchmarking in AI Progress

Praveen Paritosh, a visionary in the field of AI research, discusses the history of shared data benchmarks and how they serve to guide AI research, legacy benchmarks like SQuAD and their historical influence on the AI community, the complexity of benchmarks as proxies and the fine line between conceptual and rote learning in AI development.