ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified

Phylo Tree

Skill by billwanttobetop

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/billwanttobetop/phylo-tree
Or

PhyloTree | Publication-Grade Phylogenetic Analysis

One-line: Build Nature/Science-level phylogenetic trees from enzyme names or sequences.


🚀 Quick Start (3 steps)

# 1. Activate environment
conda activate r43

# 2. Run analysis
python3 scripts/run_v2.py --query "imine reductase" --output ./output

# 3. Done! Check ./output/figures/ for publication-ready figures

Output: ML tree + 6 figures + QC reports + scientific conclusions


📋 Common Use Cases

Use Case 1: Analyze from FASTA file (Recommended)

python3 scripts/run_v2.py --fasta sequences.fasta --output ./my_analysis

How to get sequences:

  1. Go to UniProt: https://www.uniprot.org/
  2. Search for your enzyme (e.g., "imine reductase")
  3. Click "Download" → "FASTA (canonical)"
  4. Save as sequences.fasta

Use Case 2: Analyze by enzyme name (requires UniProt API)

python3 scripts/run_v2.py --query "imine reductase" --output ./ired_analysis

Note: This uses UniProt API which may change. Manual download (Use Case 1) is more reliable.

Use Case 3: Custom parameters

python3 scripts/run_v2.py \
  --query "lipase" \
  --output ./lipase \
  --threads 10 \
  --bootstrap 1000 \
  --identity 0.90

📊 What You Get

Files generated:

  • trees/phylo.treefile - ML tree (Newick format)
  • figures/*.png - 6 publication-ready figures (300 DPI)
  • analysis_summary.json - Key statistics
  • conclusions.md - Scientific findings

Figures:

  1. Main tree (rectangular layout)
  2. Circular tree
  3. Heatmap tree (branch length gradient)
  4. Branch length distribution
  5. Genus distribution
  6. Combined multi-panel

🔧 Key Parameters

ParameterDefaultDescription
--query-Enzyme name (UniProt search)
--fasta-Input FASTA file
--output-Output directory
--threads10CPU threads
--bootstrap1000Bootstrap replicates

Full parameter list: See references/parameters.md


📖 Need More?

First time setup: references/installation.md
Troubleshooting: references/troubleshooting.md
Interpreting results: references/interpretation.md
Publication checklist: references/publication.md
AI report generation: references/ai_workflow.md


✅ Quality Standards

  • ✅ IQ-TREE ML + ModelFinder (1232 models)
  • ✅ UFBoot2 + SH-aLRT ≥ 1000
  • ✅ Alignment trimming (trimAl)
  • ✅ Deduplication (CD-HIT 90%)
  • ✅ 300 DPI figures
  • ✅ Nature/Science color schemes

Suitable for: Nature, Science, Cell, MBE, Systematic Biology, PNAS


🤖 For AI Agents

After analysis, read:

  1. analysis_summary.json - Structured statistics
  2. conclusions.md - Scientific findings
  3. references/report_template.md - Writing template

No need to parse log files!


📚 References

Metadata

Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-billwanttobetop-phylo-tree": {
      "enabled": true,
      "auto_update": true
    }
  }
}
Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

auto-proteomics

Public OpenClaw skill for low-token routing and downstream analysis of processed DDA LFQ proteomics inputs. Use when the user already has protein-level quantification tables such as MaxQuant-style `proteinGroups.txt` and needs a clear two-group downstream workflow.

billwanttobetop 4473

Automd Viz

Skill by billwanttobetop

billwanttobetop 4473

protein-phylogeny

Comprehensive protein family phylogenetic analysis workflow with quality control, conservation analysis, coevolution network analysis, and publication-ready visualization. Use when: (1) analyzing protein family evolution, (2) building phylogenetic trees from sequences, (3) identifying conserved/coevolved residues, (4) generating publication-quality figures and reports, (5) quality-controlling sequence datasets, or (6) performing systematic evolutionary analysis of enzyme families, protein superfamilies, or any homologous protein groups.

billwanttobetop 4473

automd-gromacs

AutoMD-GROMACS: Automated molecular dynamics simulation workflow - 13 Skills covering system setup, equilibration, production, analysis, free energy, ligand binding, membrane proteins, umbrella sampling, PCA, and workflows. Built-in auto-repair, 84.7% token savings. Part of the AutoMD series.

billwanttobetop 4473

protein-qc-strict

Strictest protein sequence analysis quality control workflow (3365→456 sequences). Includes literature validation, CD-HIT redundancy removal, complexity check, motif verification, MSA quality assessment, and conservation/coevolution analysis. Based on real research experience with IRED enzyme family.

billwanttobetop 4473