Optimizing generative AI by backpropagating language model feedback – Nature

Optimizing generative AI by backpropagating language model feedback – Nature

  • Brown, T. et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 33, 1877–1901 (2020).

  • Trinh, T. H., Wu, Y., Le, Q. V., He, H. & Luong, T. Solving olympiad geometry without human demonstrations. Nature 625, 476–482 (2024).

    Article 
    ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Li, Y. et al. Competition-level code generation with alphacode. Science 378, 1092–1097 (2022).

    Article 
    ADS 
    CAS 

  • Article Source
    https://www.nature.com/articles/s41586-025-08661-4