Phylo Tree
Skill by billwanttobetop
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/billwanttobetop/phylo-treePhyloTree | Publication-Grade Phylogenetic Analysis
One-line: Build Nature/Science-level phylogenetic trees from enzyme names or sequences.
🚀 Quick Start (3 steps)
# 1. Activate environment
conda activate r43
# 2. Run analysis
python3 scripts/run_v2.py --query "imine reductase" --output ./output
# 3. Done! Check ./output/figures/ for publication-ready figures
Output: ML tree + 6 figures + QC reports + scientific conclusions
📋 Common Use Cases
Use Case 1: Analyze from FASTA file (Recommended)
python3 scripts/run_v2.py --fasta sequences.fasta --output ./my_analysis
How to get sequences:
- Go to UniProt: https://www.uniprot.org/
- Search for your enzyme (e.g., "imine reductase")
- Click "Download" → "FASTA (canonical)"
- Save as
sequences.fasta
Use Case 2: Analyze by enzyme name (requires UniProt API)
python3 scripts/run_v2.py --query "imine reductase" --output ./ired_analysis
Note: This uses UniProt API which may change. Manual download (Use Case 1) is more reliable.
Use Case 3: Custom parameters
python3 scripts/run_v2.py \
--query "lipase" \
--output ./lipase \
--threads 10 \
--bootstrap 1000 \
--identity 0.90
📊 What You Get
Files generated:
trees/phylo.treefile- ML tree (Newick format)figures/*.png- 6 publication-ready figures (300 DPI)analysis_summary.json- Key statisticsconclusions.md- Scientific findings
Figures:
- Main tree (rectangular layout)
- Circular tree
- Heatmap tree (branch length gradient)
- Branch length distribution
- Genus distribution
- Combined multi-panel
🔧 Key Parameters
| Parameter | Default | Description |
|---|---|---|
--query | - | Enzyme name (UniProt search) |
--fasta | - | Input FASTA file |
--output | - | Output directory |
--threads | 10 | CPU threads |
--bootstrap | 1000 | Bootstrap replicates |
Full parameter list: See references/parameters.md
📖 Need More?
First time setup: references/installation.md
Troubleshooting: references/troubleshooting.md
Interpreting results: references/interpretation.md
Publication checklist: references/publication.md
AI report generation: references/ai_workflow.md
✅ Quality Standards
- ✅ IQ-TREE ML + ModelFinder (1232 models)
- ✅ UFBoot2 + SH-aLRT ≥ 1000
- ✅ Alignment trimming (trimAl)
- ✅ Deduplication (CD-HIT 90%)
- ✅ 300 DPI figures
- ✅ Nature/Science color schemes
Suitable for: Nature, Science, Cell, MBE, Systematic Biology, PNAS
🤖 For AI Agents
After analysis, read:
analysis_summary.json- Structured statisticsconclusions.md- Scientific findingsreferences/report_template.md- Writing template
No need to parse log files!
📚 References
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-billwanttobetop-phylo-tree": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
auto-proteomics
Public OpenClaw skill for low-token routing and downstream analysis of processed DDA LFQ proteomics inputs. Use when the user already has protein-level quantification tables such as MaxQuant-style `proteinGroups.txt` and needs a clear two-group downstream workflow.
Automd Viz
Skill by billwanttobetop
protein-phylogeny
Comprehensive protein family phylogenetic analysis workflow with quality control, conservation analysis, coevolution network analysis, and publication-ready visualization. Use when: (1) analyzing protein family evolution, (2) building phylogenetic trees from sequences, (3) identifying conserved/coevolved residues, (4) generating publication-quality figures and reports, (5) quality-controlling sequence datasets, or (6) performing systematic evolutionary analysis of enzyme families, protein superfamilies, or any homologous protein groups.
automd-gromacs
AutoMD-GROMACS: Automated molecular dynamics simulation workflow - 13 Skills covering system setup, equilibration, production, analysis, free energy, ligand binding, membrane proteins, umbrella sampling, PCA, and workflows. Built-in auto-repair, 84.7% token savings. Part of the AutoMD series.
protein-qc-strict
Strictest protein sequence analysis quality control workflow (3365→456 sequences). Includes literature validation, CD-HIT redundancy removal, complexity check, motif verification, MSA quality assessment, and conservation/coevolution analysis. Based on real research experience with IRED enzyme family.