CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction

Abstract

Drug toxicity remains a major challenge in pharmaceutical development. Recent machine learning models have improved in silico toxicity prediction, but their reliance on annotated data and lack of interpretability limit their applicability. This limits their ability to capture organ-specific toxicities driven by complex biological mechanisms. Large language models (LLMs) offer a promising alternative through step-by-step reasoning and integration of textual data, yet prior approaches lack biological context and transparent rationale. To address this issue, we propose CoTox, a novel framework that integrates LLM with chain-of-thought (CoT) reasoning for multi-toxicity prediction. CoTox combines chemical structure data, biological pathways, and Gene Ontology (GO) terms to generate interpretable toxicity predictions through step-by-step reasoning. Using GPT-4o, we show that CoTox outperforms both traditional machine learning and deep learning model. We further examine its performance across various architectures to identify where CoTox is most effective. Additionally, we find that representing chemical structures with IUPAC names, which are easier for LLMs to understand than SMILES, enhances the model’s reasoning ability and improves predictive performance. To demonstrate its practical utility in drug development, we simulated the treatment of relevant cell types with drug and incorporated the resulting biological context into the CoTox framework. This approach allowed CoTox to generate toxicity predictions aligned with physiological responses, as shown in case studies. These results highlight the potential of LLM-based frameworks to improve interpretability and support early-stage drug safety assessment.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
CTD		CTD
_figure		_figure
l1000_case_study		l1000_case_study
.gitignore		.gitignore
88_propranolol.json		88_propranolol.json
CoTox_iupac_gpt_4o.py		CoTox_iupac_gpt_4o.py
README.md		README.md
Unitox_CTD_Drug_test_1.json		Unitox_CTD_Drug_test_1.json
metric.py		metric.py
run_cotox.sh		run_cotox.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction

Abstract

About

Uh oh!

Releases

Packages

Uh oh!

Languages

dmis-lab/CoTox

Folders and files

Latest commit

History

Repository files navigation

CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction

Abstract

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages