Current Projects

×

Stress Testing and Failure Analysis of Large Language Models

This line of research focuses on developing methods for stress testing large language models to systematically identify failure modes, including jailbreak vulnerabilities, unsafe outputs, and robustness breakdowns under distribution shift. The work emphasizes evaluation frameworks and adversarial testing approaches for characterizing model behavior in high-risk or deployment-relevant settings.

Related Publications
All models are wrong, but some are deadly: Inconsistencies in emotion detection in suicide-related tweets
AM Schoene, R Ramachandranpillai, T Lazovich, RA Baeza-Yates · 2024 · NLP for Positive Impact
An example of (too much) hyper-parameter tuning in suicide ideation detection
AM Schoene, J Ortega, S Amir, K Church · 2023 · ICWSM
×

Building an Actionable Framework for Responsible AI Integration in Clinical and Public Health Contexts

Building on the AI Ethics Box, this work adapts bioethical principles to healthcare settings and links them to concrete technical methods for evaluation, governance, and post-deployment monitoring. The framework addresses the gap between high-level ethical guidance and operational decision-making in clinical and public health contexts. It is refined through interdisciplinary collaboration and stakeholder co-design, supporting practical use by clinicians, health system leaders, and developers in real-world AI deployment.

×

RAI4MH: Responsible AI for Mental Health Initiative

RAI4MH is an international partnership developing evidence-based guidance for the responsible use of AI in mental health contexts. As the U.S. lead, I contribute to coordinating interdisciplinary collaboration across computer science, mental health, ethics, and policy. Key outputs include white papers that synthesize international expert consensus and contribute directly to policy discussions, including a POSTnote for the UK Parliament on responsible AI in mental health.

Visit rai4mh.com →
×

AI Systems and Suicide

This work examines how AI systems interact with suicide-related content, focusing on system behavior, responses, safeguards, and evaluation. Ongoing work includes (i) understanding how suicide is expressed through language and emotional expression, and (ii) identifying the medical and sociodemographic contexts that drive suicide risk. These findings are used as evidence to inform policy and deployment decisions involving AI systems.

Related Publications
Lexicography saves lives (LSL): Automatically translating suicide-related language
AM Schoene, JE Ortega, RJ Zevallos, LH Ihle · 2025 · COLING
Classifying suicide-related content and emotions on Twitter using graph convolutional neural networks
AM Schoene et al. · 2022 · IEEE Transactions on Affective Computing
Automatic identification of suicide notes with a transformer-based deep learning model
T Zhang, AM Schoene, S Ananiadou · 2021 · Internet Interventions
Hierarchical multiscale recurrent neural networks for detecting suicide notes
AM Schoene, AP Turner, G De Mel, N Dethlefs · 2021 · IEEE Transactions on Affective Computing

Publications

Publications 20 publications
Ethics Guidelines for AI-Based Suicide Prevention Tool
LH Ihle, AM Schoene · 2026 · Philosophy of Artificial Intelligence: The State of the Art, 265–274
Responsible AI in mental healthcare: policy directions and stakeholder insights
AM Schoene, R Mestre, SE Middleton, A Lapedriza, A Bernard, M Lewis et al. · 2026 · Frontiers in Public Health 14, 1814039
‘For Argument’s Sake, Show Me How to Harm Myself!’: Jailbreaking LLMs in Suicide and Self-Harm Contexts
AM Schoene, C Canca · 2025 · 2025 IEEE International Symposium on Technology and Society (ISTAS)
Retrieval-enhanced mental health assessment: Capturing self-state dynamics from social media using in-context learning
A Antony, A Schoene · 2025 · Proc. 10th Workshop on Computational Linguistics and Clinical Psychology
Lexicography saves lives (LSL): Automatically translating suicide-related language
AM Schoene, JE Ortega, RJ Zevallos, LH Ihle · 2025 · Proc. 31st International Conference on Computational Linguistics
The first multilingual model for the detection of suicide texts
RJ Zevallos, AM Schoene, JE Ortega · 2025 · Proc. Second Workshop on Scaling Up Multilingual & Multicultural Evaluation
Automatically extracting social determinants of health for suicide: a narrative literature review
AM Schoene, S Garverich, I Ibrahim, S Shah, B Irving, CC Dacso · 2024 · npj Mental Health Research 3(1), 51
All models are wrong, but some are deadly: Inconsistencies in emotion detection in suicide-related tweets
AM Schoene, R Ramachandranpillai, T Lazovich, RA Baeza-Yates · 2024 · Proc. Third Workshop on NLP for Positive Impact
Is it safe to machine translate suicide-related language from English to Galician?
JE Ortega, AM Schoene · 2024 · Proc. 16th International Conference on Computational Processing of Portuguese
An example of (too much) hyper-parameter tuning in suicide ideation detection
AM Schoene, J Ortega, S Amir, K Church · 2023 · Proc. International AAAI Conference on Web and Social Media 17
Classifying suicide-related content and emotions on Twitter using graph convolutional neural networks
AM Schoene, L Bojanić, MQ Nghiem, IM Hunt, S Ananiadou · 2022 · IEEE Transactions on Affective Computing 14(3), 1791–1802
A narrative literature review of natural language processing applied to the occupational exposome
AM Schoene, I Basinas, M Van Tongeren, S Ananiadou · 2022 · International Journal of Environmental Research and Public Health 19(14), 8544
Natural language processing applied to mental illness detection: a narrative review
T Zhang, AM Schoene, S Ji, S Ananiadou · 2022 · NPJ Digital Medicine 5(1), 46
Exposome methods in occupational epidemiology: Use of text mining for developing Job Exposure Matrices
M van Tongeren, C Ge, E Kuijpers, S Ananiadou et al. · 2021 · Occupational and Environmental Medicine 78(Suppl 1), A162–A163
NERO: a biomedical named-entity recognition ontology with a large, annotated corpus reveals meaningful associations through text embedding
K Wang, R Stevens, H Alachram, Y Li, L Soldatova, R King, S Ananiadou et al. · 2021 · NPJ Systems Biology and Applications 7(1), 38
Automatic identification of suicide notes with a transformer-based deep learning model
T Zhang, AM Schoene, S Ananiadou · 2021 · Internet Interventions 25, 100422
Hierarchical multiscale recurrent neural networks for detecting suicide notes
AM Schoene, AP Turner, G De Mel, N Dethlefs · 2021 · IEEE Transactions on Affective Computing 14(1), 153–164
Dilated LSTM with ranked units for classification of suicide notes
AM Schoene, AP Turner, N Dethlefs · 2019 · AI for Social Good @ NeurIPS
Dilated LSTM with attention for classification of suicide notes
AM Schoene, G Lacey, AP Turner, N Dethlefs · 2019 · Proc. Tenth International Workshop on Health Text Mining and Information Analysis
Automatic identification of suicide notes from linguistic and sentiment features
AM Schoene, N Dethlefs · 2016 · Proc. 10th SIGHUM Workshop on Language Technology for Cultural Heritage