Research & Projects

Current Projects

Stress Testing and Failure Analysis of Large Language Models

Learn more →

Building an Actionable Framework for Responsible AI Integration in Clinical and Public Health Contexts

Learn more →

RAI4MH: Responsible AI for Mental Health Initiative

Learn more →

AI Systems and Suicide

Learn more →

Stress Testing and Failure Analysis of Large Language Models

This line of research focuses on developing methods for stress testing large language models to systematically identify failure modes, including jailbreak vulnerabilities, unsafe outputs, and robustness breakdowns under distribution shift. The work emphasizes evaluation frameworks and adversarial testing approaches for characterizing model behavior in high-risk or deployment-relevant settings.

Related Publications

‘For Argument’s Sake, Show Me How to Harm Myself!’: Jailbreaking LLMs in Suicide and Self-Harm Contexts

AM Schoene, C Canca · 2025 · ISTAS

All models are wrong, but some are deadly: Inconsistencies in emotion detection in suicide-related tweets

AM Schoene, R Ramachandranpillai, T Lazovich, RA Baeza-Yates · 2024 · NLP for Positive Impact

An example of (too much) hyper-parameter tuning in suicide ideation detection

AM Schoene, J Ortega, S Amir, K Church · 2023 · ICWSM

Building an Actionable Framework for Responsible AI Integration in Clinical and Public Health Contexts

Building on the AI Ethics Box, this work adapts bioethical principles to healthcare settings and links them to concrete technical methods for evaluation, governance, and post-deployment monitoring. The framework addresses the gap between high-level ethical guidance and operational decision-making in clinical and public health contexts. It is refined through interdisciplinary collaboration and stakeholder co-design, supporting practical use by clinicians, health system leaders, and developers in real-world AI deployment.

Funding

Tufts CTSI Small Grants to Advance Translational Science (S-GATS) →

RAI4MH: Responsible AI for Mental Health Initiative

RAI4MH is an international partnership developing evidence-based guidance for the responsible use of AI in mental health contexts. As the U.S. lead, I contribute to coordinating interdisciplinary collaboration across computer science, mental health, ethics, and policy. Key outputs include white papers that synthesize international expert consensus and contribute directly to policy discussions, including a POSTnote for the UK Parliament on responsible AI in mental health.

Visit rai4mh.com →

Related Publications

Building Responsible AI for Mental Health: Insights from the First RAI4MH Workshop

R Mestre, AM Schoene et al. · 2024 · White Paper

AI in Mental Healthcare: Opportunities and Delivery Considerations (POSTnote PN737)

UK Parliament POST · 2025 · POSTnote

AI and Mental Healthcare – Ethical and Regulatory Considerations (POSTnote PN738)

UK Parliament POST · 2025 · POSTnote

Responsible AI in mental healthcare: policy directions and stakeholder insights

AM Schoene et al. · 2026 · Frontiers in Public Health

AI Systems and Suicide

This work examines how AI systems interact with suicide-related content, focusing on system behavior, responses, safeguards, and evaluation. Ongoing work includes (i) understanding how suicide is expressed through language and emotional expression, and (ii) identifying the medical and sociodemographic contexts that drive suicide risk. These findings are used as evidence to inform policy and deployment decisions involving AI systems.

Related Publications

Lexicography saves lives (LSL): Automatically translating suicide-related language

AM Schoene, JE Ortega, RJ Zevallos, LH Ihle · 2025 · COLING

Automatically extracting social determinants of health for suicide: a narrative literature review

AM Schoene et al. · 2024 · npj Mental Health Research

Classifying suicide-related content and emotions on Twitter using graph convolutional neural networks

AM Schoene et al. · 2022 · IEEE Transactions on Affective Computing

Automatic identification of suicide notes with a transformer-based deep learning model

T Zhang, AM Schoene, S Ananiadou · 2021 · Internet Interventions

Hierarchical multiscale recurrent neural networks for detecting suicide notes

AM Schoene, AP Turner, G De Mel, N Dethlefs · 2021 · IEEE Transactions on Affective Computing

Publications

Publications 20 publications

All 2026 2025 2024 2023 2022 2021 2020 2019 2018 2016

Ethics Guidelines for AI-Based Suicide Prevention Tool

LH Ihle, AM Schoene · 2026 · Philosophy of Artificial Intelligence: The State of the Art, 265–274

Responsible AI in mental healthcare: policy directions and stakeholder insights

AM Schoene, R Mestre, SE Middleton, A Lapedriza, A Bernard, M Lewis et al. · 2026 · Frontiers in Public Health 14, 1814039

‘For Argument’s Sake, Show Me How to Harm Myself!’: Jailbreaking LLMs in Suicide and Self-Harm Contexts

AM Schoene, C Canca · 2025 · 2025 IEEE International Symposium on Technology and Society (ISTAS)

Retrieval-enhanced mental health assessment: Capturing self-state dynamics from social media using in-context learning

A Antony, A Schoene · 2025 · Proc. 10th Workshop on Computational Linguistics and Clinical Psychology

Lexicography saves lives (LSL): Automatically translating suicide-related language

AM Schoene, JE Ortega, RJ Zevallos, LH Ihle · 2025 · Proc. 31st International Conference on Computational Linguistics

The first multilingual model for the detection of suicide texts

RJ Zevallos, AM Schoene, JE Ortega · 2025 · Proc. Second Workshop on Scaling Up Multilingual & Multicultural Evaluation

Automatically extracting social determinants of health for suicide: a narrative literature review

AM Schoene, S Garverich, I Ibrahim, S Shah, B Irving, CC Dacso · 2024 · npj Mental Health Research 3(1), 51

All models are wrong, but some are deadly: Inconsistencies in emotion detection in suicide-related tweets

AM Schoene, R Ramachandranpillai, T Lazovich, RA Baeza-Yates · 2024 · Proc. Third Workshop on NLP for Positive Impact

Is it safe to machine translate suicide-related language from English to Galician?

JE Ortega, AM Schoene · 2024 · Proc. 16th International Conference on Computational Processing of Portuguese

An example of (too much) hyper-parameter tuning in suicide ideation detection

AM Schoene, J Ortega, S Amir, K Church · 2023 · Proc. International AAAI Conference on Web and Social Media 17

Classifying suicide-related content and emotions on Twitter using graph convolutional neural networks

AM Schoene, L Bojanić, MQ Nghiem, IM Hunt, S Ananiadou · 2022 · IEEE Transactions on Affective Computing 14(3), 1791–1802

A narrative literature review of natural language processing applied to the occupational exposome

AM Schoene, I Basinas, M Van Tongeren, S Ananiadou · 2022 · International Journal of Environmental Research and Public Health 19(14), 8544

Natural language processing applied to mental illness detection: a narrative review

T Zhang, AM Schoene, S Ji, S Ananiadou · 2022 · NPJ Digital Medicine 5(1), 46

Exposome methods in occupational epidemiology: Use of text mining for developing Job Exposure Matrices

M van Tongeren, C Ge, E Kuijpers, S Ananiadou et al. · 2021 · Occupational and Environmental Medicine 78(Suppl 1), A162–A163

NERO: a biomedical named-entity recognition ontology with a large, annotated corpus reveals meaningful associations through text embedding

K Wang, R Stevens, H Alachram, Y Li, L Soldatova, R King, S Ananiadou et al. · 2021 · NPJ Systems Biology and Applications 7(1), 38

Automatic identification of suicide notes with a transformer-based deep learning model

T Zhang, AM Schoene, S Ananiadou · 2021 · Internet Interventions 25, 100422

Hierarchical multiscale recurrent neural networks for detecting suicide notes

AM Schoene, AP Turner, G De Mel, N Dethlefs · 2021 · IEEE Transactions on Affective Computing 14(1), 153–164

Dilated LSTM with ranked units for classification of suicide notes

AM Schoene, AP Turner, N Dethlefs · 2019 · AI for Social Good @ NeurIPS

Dilated LSTM with attention for classification of suicide notes

AM Schoene, G Lacey, AP Turner, N Dethlefs · 2019 · Proc. Tenth International Workshop on Health Text Mining and Information Analysis

Automatic identification of suicide notes from linguistic and sentiment features

AM Schoene, N Dethlefs · 2016 · Proc. 10th SIGHUM Workshop on Language Technology for Cultural Heritage