General
Best Practices
Event Procedure
Helpdesk Request Form
Lab Closure Procedure
Links
Resource Drive Conventions
Resource Map
Shipping (FedEx)
Shipping (International)
Suggestion Box
Support Article Guidelines
Welcome
Information Technology
Email & Calendar
Add Calendars to iPhone
Confidential emails
Create a Shared Calendar
Create an Email Group
Email Filters & Rules
Email Groups
Email Headers
Email Signatures
Google 2-Step Verification
Google Calendar Overview
Phishing
Schedule emails
Staff Minus One Emails
Using Outlook with Gmail
Using Yubico Security Keys with your Google account
Print & Fax
Software
1Password
Adobe Acrobat DC
Adobe Creative Cloud
ArcGIS
Colby VPN
Combine PDFs in Adobe Acrobat
Excel Trust Settings
Install Falcon Antivirus
Microsoft Office
Microsoft Office Runtime Error Fix
Mosyle Mac Enrollment
Raiser's Edge
Slack
Software Resources
Uninstall OpenVPN
Windows 10 - Restore from backup
Updates
2020 December - email name spoofing
2020 October - COVID resources
2021 December - WiFi
2021 January - Zoom recording + private chat
2021 March - NetSuite Google authentication
2021 March - VPN Upgrade
2023 - Zoom Updates
2024 May - VPN SSO
DNS
DNS Change
Data Storage and Computer Backups
Google Drive
HPCC and Storage Proposal Information
Laptop Recommendations
Loaner Hardware
Migrating data from Storage to Google Drive
Passwords
Phones
Restoring Files
Storage
VPN
VPN Migration
Vendor Access
Website Request
WiFi
Zeiss Digital Classroom
HR & Payroll
Paid Time Off
Payroll Overview & FAQ
Personnel Offboarding
Personnel Onboarding
Timesheet Approval (supervisors)
Timesheets
Facilities
BMS Access
Bigelow R/V Billing Form
E&I Wing Construction Update
R/V Bowditch Reservation Center
R/V Clarice Reservation Center
Finance
Admin
Budget & Reports
Invoicing
Policies & Procedures
Advancement Entry of Donations and Pledges
Corporate Traveler / Melon
Gas and Cryo-Supply Ordering Process and Form Link
Purchasing Flowchart - for staff reference
Purchasing Policy
Vendors Exempt from Purchase Orders
Proposals
Purchase & Expense
Bill/Invoice Approval
Creating a Bill to be Paid
Equipment Capitalization Help
Expense Report
Expense Report (example)
Non-Employee Reimbursement
Purchase Order
Purchase Order (example)
Purchase Order (supplemental)
Recurring Purchase Order (SRS)
Amazon.com
Approval Reminders
Business Office Orientation
Capital One - Corporate Credit Card
Customize Dashboard
Dashboard (SRS)
NetSuite FAQ
NetSuite Login
NetSuite shortcuts
Revenue Flow Chart
Workshop, Training Projects, and Participant Support Help
Computing
Software
AAI Calculation
ANI Calculation
AlphaFold
Anvi'o
Conda environments
Jupyter notebook
Prokka
RStudio
dada2
sag-mg-recruit
Job management
Charlie Overview
Connect to Charlie
Edit with VS Code
Getting Started
Monitor jobs
Software modules
Transfer files
Zoom
- Home
- Computing
- Software
- ANI Calculation
ANI Calculation
Updated
The quickest way to calculate ANI is with FastANI. It is available as a module. To load:
module use /mod/scgc
module load fastani
FastANI instructions:
$ fastANI --help
-----------------
fastANI is a fast alignment-free implementation for computing whole-genome
Average Nucleotide Identity (ANI) between genomes
-----------------
Example usage:
$ fastANI -q genome1.fa -r genome2.fa -o output.txt
$ fastANI -q genome1.fa --rl genome_list.txt -o output.txt
Available options
-----------------
-h, --help
Print this help page
-r <value>, --ref <value>
reference genome (fasta/fastq)[.gz]
--refList <value>, --rl <value>
a file containing list of reference genome files, one genome per line
-q <value>, --query <value>
query genome (fasta/fastq)[.gz]
--ql <value>, --queryList <value>
a file containing list of query genome files, one genome per line
-k <value>, --kmer <value>
kmer size <= 16 [default : 16]
-t <value>, --threads <value>
thread count for parallel execution [default : 1]
--fragLen <value>
fragment length [default : 3,000]
--minFrag <value>
minimum matched fragments for trusting ANI [default : 50]
--visualize
output mappings for visualization, can be enabled for single genome to
single genome comparison only [disabled by default]
--matrix
also output ANI values as lower triangular matrix (format inspired from
phylip). If enabled, you should expect an output file with .matrix
extension [disabled by default]
-o <value>, --output <value> [required]
output file name
Another way to calculate ANI is by using pyani, which is a BLAST-based method. pyani is installed within SCGC's anaconda3 module, and can be run on c1 and c2.
To run pyani:
- Place all fasta files that you'd like to compare into a directory
- Load SCGC's anaconda3 module
module use /mod/scgc
module load anaconda3
module load blast
Run pyani:
average_nucleotide_identity.py -o {outfile} -i {contig_directory} -m {choose_method, we use ANIb} --workers {threads}
Example PBS script:
#!/bin/bash
## set name of PBS job
#PBS -N pyani
## set the queue
#PBS -q scgc-route
#PBS -l walltime=96:00:00
#PBS -l mem=10G
#PBS -l ncpus=30
#PBS -j oe
#PBS -o /home/julia/out/180724_pyani.out
module unload anaconda
module load anaconda3
indir=/mnt/scgc/simon/simonsproject/contigs/
outdir = /mnt/scgc/simon/simonsproject/ani_out/
mkdir $outdir
average_nucleotide_identity.py -o ${outdir} \
-i ${indir} -m ANIb --workers 30