Data & ReproDrug Discovery & CheminformaticsK-Dense-AI/claude-scientific-skillsData & Reproduction
DE

DeepChem

Maintainer K-Dense Inc. · Last updated April 1, 2026

DeepChem is a comprehensive Python library for applying machine learning to chemistry, materials science, and biology. Enable molecular property prediction, drug discovery, materials design, and biomolecule analysis through speciali.

Claude CodeOpenClawNanoClawAnalysisReproductiondeepchemchemistrypackagecheminformatics & drug discovery

Original source

K-Dense-AI/claude-scientific-skills

https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/deepchem

Maintainer
K-Dense Inc.
License
MIT license
Last updated
April 1, 2026

Skill Snapshot

Key Details From SKILL.md

2 min

Key Notes

  • DeepChem is a comprehensive Python library for applying machine learning to chemistry, materials science, and biology. Enable molecular property prediction, drug discovery, materials design, and biomolecule analysis through specialized neural networks, molecular featurization methods, and pretrained models.
  • featurizer = dc.feat.CircularFingerprint(radius=2, size=2048) loader = dc.data.CSVLoader( tasks=['solubility', 'toxicity'], feature_field='smiles', featurizer=featurizer ) dataset = loader.create_dataset('molecules.csv').

Source Doc

Excerpt From SKILL.md

When to Use This Skill

This skill should be used when:

  • Loading and processing molecular data (SMILES strings, SDF files, protein sequences)
  • Predicting molecular properties (solubility, toxicity, binding affinity, ADMET properties)
  • Training models on chemical/biological datasets
  • Using MoleculeNet benchmark datasets (Tox21, BBBP, Delaney, etc.)
  • Converting molecules to ML-ready features (fingerprints, graph representations, descriptors)
  • Implementing graph neural networks for molecules (GCN, GAT, MPNN, AttentiveFP)
  • Applying transfer learning with pretrained models (ChemBERTa, GROVER, MolFormer)
  • Predicting crystal/materials properties (bandgap, formation energy)
  • Analyzing protein or DNA sequences

1. Molecular Data Loading and Processing

DeepChem provides specialized loaders for various chemical data formats:

import deepchem as dc

## Load SDF files

loader = dc.data.SDFLoader(tasks=['activity'], featurizer=featurizer)
dataset = loader.create_dataset('compounds.sdf')

Use cases

  • Loading and processing molecular data (SMILES strings, SDF files, protein sequences).
  • Predicting molecular properties (solubility, toxicity, binding affinity, ADMET properties).
  • Training models on chemical/biological datasets.
  • Using MoleculeNet benchmark datasets (Tox21, BBBP, Delaney, etc.).

Not for

  • Do not rely on this catalog entry alone for installation or maintenance details.

Related skills

Related skills

Back to directory
BI
Data & ReproDrug Discovery & Cheminformatics

bio-clinical-databases-pharmacogenomics

Query PharmGKB and CPIC for drug-gene interactions, pharmacogenomic annotations, and dosing guidelines. Use when predicting drug response fr…

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
CH
Data & ReproDrug Discovery & Cheminformatics

chematagent-drug-discovery

CheMatAgent: chemistry-aware drug design with retrosynthesis and property optimization.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
CH
Data & ReproDrug Discovery & Cheminformatics

ChEMBL

ChEMBL is a manually curated database of bioactive molecules maintained by the European Bioinformatics Institute (EBI), containing over 2 mi…

Claude CodeOpenClawAnalysis
K-Dense-AI/claude-scientific-skillsView
CH
Data & ReproDrug Discovery & Cheminformatics

chemcrow-drug-discovery

ChemCrow drug discovery toolkit: web search, Python, chemical tools integration.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView