ResearchExploratory Analysis of TRLX RLHF TransformersEleutherAI Blog·April 2, 2023·medium confidenceWhy it matters→This analysis contributes to understanding the interpretability of RLHF models, which is crucial for AI practitioners working with complex transformer architectures.©EleutherAI BlogThe EleutherAI Blog presents a demonstration of interpretability for RLHF (Reinforcement Learning from Human Feedback) models using TransformerLens.Read original
© MIT News AIInvestment · $97 millionResearchotherBeacon Biosignals maps brain activity during sleepBeacon Biosignals is developing a headband to monitor brain activity during sleep, using machine learning to analyze data for neurological disorders. The company recently raised $97 million to expand its platform and clinical trials.MIT News AI·May 1, 2026
© Microsoft ResearchResearchagentsRed-teaming AI agent networks reveals new vulnerabilitiesMicrosoft Research explores vulnerabilities in networks of AI agents, highlighting risks that emerge only through interaction. Their tests reveal how malicious messages can propagate and manipulate agent behavior.Microsoft Research·Apr 30, 2026