Optimizing generative AI by backpropagating language model feedback – Nature

Brown, T. et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 33, 1877–1901 (2020).

Trinh, T. H., Wu, Y., Le, Q. V., He, H. & Luong, T. Solving olympiad geometry without human demonstrations. Nature 625, 476–482 (2024).

Article
ADS
CAS
PubMed
PubMed Central

Google Scholar

Li, Y. et al. Competition-level code generation with alphacode. Science 378, 1092–1097 (2022).

Article
ADS
CAS
…

Article Source
https://www.nature.com/articles/s41586-025-08661-4

Related Posts