Official Verified

Phylo Tree

Skill by billwanttobetop

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/billwanttobetop/phylo-tree

Download Source Code (.zip)

PhyloTree | Publication-Grade Phylogenetic Analysis

One-line: Build Nature/Science-level phylogenetic trees from enzyme names or sequences.

🚀 Quick Start (3 steps)

# 1. Activate environment
conda activate r43

# 2. Run analysis
python3 scripts/run_v2.py --query "imine reductase" --output ./output

# 3. Done! Check ./output/figures/ for publication-ready figures

Output: ML tree + 6 figures + QC reports + scientific conclusions

📋 Common Use Cases

Use Case 1: Analyze from FASTA file (Recommended)

python3 scripts/run_v2.py --fasta sequences.fasta --output ./my_analysis

How to get sequences:

Go to UniProt: https://www.uniprot.org/
Search for your enzyme (e.g., "imine reductase")
Click "Download" → "FASTA (canonical)"
Save as sequences.fasta

Use Case 2: Analyze by enzyme name (requires UniProt API)

python3 scripts/run_v2.py --query "imine reductase" --output ./ired_analysis

Note: This uses UniProt API which may change. Manual download (Use Case 1) is more reliable.

Use Case 3: Custom parameters

python3 scripts/run_v2.py \
  --query "lipase" \
  --output ./lipase \
  --threads 10 \
  --bootstrap 1000 \
  --identity 0.90

📊 What You Get

Files generated:

trees/phylo.treefile - ML tree (Newick format)
figures/*.png - 6 publication-ready figures (300 DPI)
analysis_summary.json - Key statistics
conclusions.md - Scientific findings

Figures:

Main tree (rectangular layout)
Circular tree
Heatmap tree (branch length gradient)
Branch length distribution
Genus distribution
Combined multi-panel

🔧 Key Parameters

Parameter	Default	Description
`--query`	-	Enzyme name (UniProt search)
`--fasta`	-	Input FASTA file
`--output`	-	Output directory
`--threads`	10	CPU threads
`--bootstrap`	1000	Bootstrap replicates

Full parameter list: See references/parameters.md

📖 Need More?

First time setup: references/installation.md
Troubleshooting: references/troubleshooting.md
Interpreting results: references/interpretation.md
Publication checklist: references/publication.md
AI report generation: references/ai_workflow.md

✅ Quality Standards

✅ IQ-TREE ML + ModelFinder (1232 models)
✅ UFBoot2 + SH-aLRT ≥ 1000
✅ Alignment trimming (trimAl)
✅ Deduplication (CD-HIT 90%)
✅ 300 DPI figures
✅ Nature/Science color schemes

Suitable for: Nature, Science, Cell, MBE, Systematic Biology, PNAS

🤖 For AI Agents

After analysis, read:

analysis_summary.json - Structured statistics
conclusions.md - Scientific findings
references/report_template.md - Writing template

No need to parse log files!

📚 References

Read Full Documentation on GitHub

Metadata

Author@billwanttobetop

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-billwanttobetop-phylo-tree": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

auto-proteomics

Public OpenClaw skill for low-token routing and downstream analysis of processed DDA LFQ proteomics inputs. Use when the user already has protein-level quantification tables such as MaxQuant-style `proteinGroups.txt` and needs a clear two-group downstream workflow.

billwanttobetop 4473

Automd Viz

Skill by billwanttobetop

billwanttobetop 4473

protein-phylogeny

Comprehensive protein family phylogenetic analysis workflow with quality control, conservation analysis, coevolution network analysis, and publication-ready visualization. Use when: (1) analyzing protein family evolution, (2) building phylogenetic trees from sequences, (3) identifying conserved/coevolved residues, (4) generating publication-quality figures and reports, (5) quality-controlling sequence datasets, or (6) performing systematic evolutionary analysis of enzyme families, protein superfamilies, or any homologous protein groups.

billwanttobetop 4473

automd-gromacs

AutoMD-GROMACS: Automated molecular dynamics simulation workflow - 13 Skills covering system setup, equilibration, production, analysis, free energy, ligand binding, membrane proteins, umbrella sampling, PCA, and workflows. Built-in auto-repair, 84.7% token savings. Part of the AutoMD series.

billwanttobetop 4473

protein-qc-strict

Strictest protein sequence analysis quality control workflow (3365→456 sequences). Includes literature validation, CD-HIT redundancy removal, complexity check, motif verification, MSA quality assessment, and conservation/coevolution analysis. Based on real research experience with IRED enzyme family.

billwanttobetop 4473