Testing LLM performance on the Physics GRE: some observations

December 07, 2023 · Declared Dead · 🏛 arXiv.org

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Pranav Gupta arXiv ID 2312.04613 Category physics.ed-ph Cross-listed cs.LG Citations 3 Venue arXiv.org Last Checked 1 month ago

Abstract

With the recent developments in large language models (LLMs) and their widespread availability through open source models and/or low-cost APIs, several exciting products and applications are emerging, many of which are in the field of STEM educational technology for K-12 and university students. There is a need to evaluate these powerful language models on several benchmarks, in order to understand their risks and limitations. In this short paper, we summarize and analyze the performance of Bard, a popular LLM-based conversational service made available by Google, on the standardized Physics GRE examination.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — physics.ed-ph

R.I.P. 👻 Ghosted

Unreflected Acceptance -- Investigating the Negative Consequences of ChatGPT-Assisted Problem Solving in Physics Education

Lars Krupp, Steffen Steinert, ... (+6 more)

physics.ed-ph 🏛 HHAI 📚 47 cites 2 years ago

R.I.P. 👻 Ghosted

Use of Eye-Tracking Technology to Investigate Cognitive Load Theory

Tianlong Zu, John Hutson, ... (+2 more)

physics.ed-ph 🏛 arXiv 📚 30 cites 8 years ago

R.I.P. 👻 Ghosted

Beyond Answers: Large Language Model-Powered Tutoring System in Physics Education for Deep Learning and Precise Understanding

Zhoumingju Jiang, Mengjun Jiang

physics.ed-ph 🏛 arXiv 📚 11 cites 1 year ago

R.I.P. 👻 Ghosted

How Peripheral Interactive Systems Can Support Teachers with Differentiated Instruction: Using FireFlies as a Probe

Nine Sellier, Pengcheng An

physics.ed-ph 🏛 DIS 📚 8 cites 5 years ago

R.I.P. 👻 Ghosted

Combining surveys and sensors to explore student behaviour

Inkeri Kontro, Mathieu Génois

physics.ed-ph 🏛 Education sciences 📚 6 cites 6 years ago

R.I.P. 👻 Ghosted

Innovative Approaches to Teaching Quantum Computer Programming and Quantum Software Engineering

Majid Haghparast, Enrique Moguel, ... (+3 more)

physics.ed-ph 🏛 QCE 📚 6 cites 1 year ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago