Описание
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About this Role This role offers a unique opportunity to shape how AI systems handle sensitive chemical and explosives information. You'll work with leading AI safety researchers while tackling critical problems in preventing catastrophic misuse. If you're excited about using your expertise to ensure AI systems remain safe and beneficial, we want to hear from you.
Responsibilities
Design and implement evaluation methodologies for assessing AI model capabilities relevant to chemical weapons, explosives synthesis, and energetic materials Develop and execute strategies to identify and mitigate potential C/E misuse in model outputs Create C/E threat models, including precursor identification, synthesis routes, and weaponization techniques Review and analyze traffic to identify potential policy violations related to C/E content Collaborate with software engineers to develop and refine detection systems and automated enforcement tools for C/E threats Conduct rapid response to escalations involving dangerous C/E queries Collaborate across teams to establish safety benchmarks and develop appropriate model guardrails Translate C/E domain knowledge into actionable safety requirements Develop approaches to assess
Контакты работодателя (email/phone/telegram) скрыты из публичного превью —
отправьте резюме, чтобы мы связали вас напрямую.