Search compounds by name. Data from PubChem, vendors and regulatory agencies.
| Name | SMILES | Category | Triangle | Tox-score | SAscore |
|---|
This is a searchable database of chemical compounds pulled from multiple sources — regulatory lists, Wikipedia, PsychonautWiki, TripSit, and various government agencies (IARC, EPA, EU REACH, NTP, and others). Each chemical has been looked up on PubChem to get its SMILES notation, then run through EtoxPred to estimate toxicity.
Tox-score (0 to 1): This comes from EtoxPred, a machine learning model trained on FDA-approved drugs (labeled safe) and TOXNET/T3DB chemicals (labeled toxic). It uses Morgan fingerprints — basically a mathematical representation of molecular structure — fed into an Extra Trees classifier. A score near 0 means the molecule looks structurally similar to known safe drugs. A score near 1 means it resembles known toxic compounds. The model gets about 80% accuracy on independent test sets, so treat it as a rough guide, not gospel.
Things to keep in mind:
SAscore (Synthetic Accessibility, ~0 to 1 after transformation): How easy or hard the molecule would be to synthesize in a lab. Lower values mean easier to make. This is based on fragment analysis — common molecular fragments that show up a lot in known synthetic compounds get better scores. Unusual ring systems, lots of stereocenters, and weird functional groups push the score up.
Why this matters:
Chemicals can appear in multiple categories:
A chemical appearing in the carcinogen list doesn't necessarily mean it causes cancer in humans at normal exposure levels — IARC classifications are hazard-based, not risk-based. Similarly, something listed as an endocrine disruptor means it has the potential to interfere with hormonal systems, but dose and exposure route matter enormously.
Each chemical gets a small triangle (CEP) chart showing three independent scores at a glance — Carcinogenic (top, red), Endocrine disruptor (bottom-left, gold), and Psychoactive (bottom-right, blue). Each score ranges from 0 to 1.
The outer triangle is the reference boundary (all scores at 1.0). The inner shaded polygon shows the compound's actual scores. Each vertex is pulled from the center toward its corner proportional to the score:
The exact values are shown below the graph as C:0.xx E:0.xx P:0.xx. These scores come from sub-models trained on category-specific datasets, so they are independent of the overall Tox-score. A compound can have a low Tox-score but still show a high score on one of these axes, or vice versa.