Senior Audrey Lorvo is researching AI safety, which seeks to ensure increasingly intelligent AI models are reliable and can benefit humanity. The growing field focuses on technical challenges like robustness and AI alignment with human values, as well as societal concerns like transparency and accountability. Practitioners are also concerned with the potential existential risks associated with increasingly powerful AI tools.
“Ensuring AI isn’t misused or acts…
Article Source
https://news.mit.edu/2025/audrey-lorvo-aligning-ai-human-values-0204