📚 AIPAMPDS User Guide

Welcome to the AIPAMPDS (AI-based Antimicrobial Peptide Design System) help center. This guide will help you understand how to use all features of the platform.

📋 Table of Contents

1. Generation Module
2. Screening Module
3. Job Status Module
4. Scoring System Explained
5. Model Details & Thresholds
6. Frequently Asked Questions

🧬 1. Generation Module

Overview

The Generation module uses deep learning models to generate novel antimicrobial peptides (AMPs) with specific properties based on your target pathogen selection. Generated peptides are automatically evaluated for their antimicrobial activity, safety profile, and overall therapeutic potential.

How to Use

Navigate to the Generation Page
Click on "Generation" in the navigation menu.
Select Target Pathogen
Choose one of the following options:

Broad-spectrum AMPs: Generates peptides active against both Gram-positive and Gram-negative bacteria
Gram-negative-specific AMPs: Generates peptides targeting Gram-negative bacteria (e.g., Escherichia coli)
Gram-positive-specific AMPs: Generates peptides targeting Gram-positive bacteria (e.g., Staphylococcus aureus)

Set Number of Sequences
Specify how many peptide sequences you want to generate (typically 10-100 sequences recommended).
Submit Job
Click the "Generate" button to start the generation process. You'll receive a unique Job ID.
Track Progress
Use the Job ID to monitor your job in the Job Status page or wait on the current page for results.

Example Output

Here's what the generated results look like:

Sequence	AMP_Score	E.coli_Score	S.aureus_Score	Hemolytic_Score	Score	Evaluate
GLFDIVKKVVGAFGSL	0.9995	0.9985	0.9992	0.1234	4	Good
KWKLFKKIGIGAVLKVL	0.9987	0.9982	0.9988	0.6543	3	Mediocre
FFHHIFRGIVHVGKTIHRLVTG	0.9923	0.9875	0.9801	0.2345	1	Bad

💡 Note about Target-Specific Results:

Broad-spectrum: Shows all 4 scores (AMP, E.coli, S.aureus, Hemolytic) - Maximum score: 4/4
Gram-negative-specific target: Shows AMP, E.coli, and Hemolytic scores only - Maximum score: 3/3
Gram-positive-specific target: Shows AMP, S.aureus, and Hemolytic scores only - Maximum score: 3/3

💡 Note: If a peptide is clearly not an antimicrobial peptide, its sequence will be filtered out (for example, peptides longer than 50 amino acids or shorter than 5 amino acids) and will not appear in the final results.

Download Results

Once the job is complete, you can download the results as a CSV file containing all generated sequences and their evaluation scores. The file name will include your Job ID for easy reference.

💡 Job Processing Time:

Typically 15 seconds for 100 sequences
Processing time varies based on server load and number of sequences

🔬 2. Screening Module

Overview

The Screening module evaluates your own peptide sequences using our trained models. Upload a list of peptide sequences, and the system will predict their antimicrobial properties, antibacterial activity score, and hemolytic score.

How to Use

Navigate to the Screening Page
Click on "Screening" in the navigation menu.
Prepare Your Sequences
Format your peptide sequences as follows:
>peptide_1 GLFDIVKKVVGAFGSL >peptide_2 KWKLFKKIGIGAVLKVL >peptide_3 FFHHIFRGIVHVGKTIHRLVTG

FASTA format with header lines starting with ">"
One sequence per peptide
Amino acid sequences using standard single-letter codes
Maximum file size: 5MB

Upload Sequences
You can either:

Paste sequences directly into the text area, or
Upload a FASTA file (.fasta, .fa, .txt)

Submit Job
Click "Screen Sequences" to start the analysis. You'll receive a unique Job ID.
Review Results
Results are sorted by score (highest to lowest) to help you identify the best candidates quickly.

Example Input

>seq1_test_peptide
GLFDIVKKVVGAFGSL
>seq2_candidate
KWKLFKKIGIGAVLKVL
>seq3_experimental
FFHHIFRGIVHVGKTIHRLVTG
>seq4_novel_amp
GIGKFLHSAKKFGKAFVGEIMNS

Example Output

Here's what the screening results look like:

Sequence	AMP_Score	E.coli_Score	S.aureus_Score	Hemolytic_Score	Score	Evaluate
GLFDIVKKVVGAFGSL	0.9995	0.9985	0.9992	0.1234	4	Good
GIGKFLHSAKKFGKAFVGEIMNS	0.9989	0.9983	0.9990	0.3456	4	Good
KWKLFKKIGIGAVLKVL	0.9987	0.9982	0.9988	0.6543	3	Mediocre
FFHHIFRGIVHVGKTIHRLVTG	0.9923	0.9875	0.9801	0.2345	1	Bad

💡 Note: If a peptide is clearly not an antimicrobial peptide, its sequence will be filtered out (for example, peptides longer than 50 amino acids or shorter than 5 amino acids) and will not appear in the final results.

Download Results

Download results as CSV for further analysis in Excel, R, Python, or other tools. Results include all original sequence names for easy tracking.

💡 Job Processing Time:

Typically 9 seconds for 100 sequences
Processing time varies based on server load and number of sequences

📊 3. Job Status Module

Overview

The Job Status module allows you to track the progress of your submitted jobs, view completed results, and download output files.

How to Use

Navigate to Job Status Page
Click on "Job Status" in the navigation menu.
Enter Your Job ID
Paste the Job ID you received when submitting your generation or screening job.
Example Job ID: 9a972711-7516-4d61-a30d-c45731daa5b1
Check Status
Click "Check Status" to view your job information. You'll see:

Status: PENDING, PROCESSING, COMPLETED, or FAILED
Progress: Percentage complete (if available)
Type: Generation or Screening
Created: Submission timestamp

Download Results
Click the "Download Results" button to save the complete CSV file to your computer.

⚠️ Important: Save your Job ID! Results are stored for 30 days, after which they may be automatically deleted to save server space.

🎯 4. Scoring System Explained

Overview

Our scoring system evaluates peptides based on multiple criteria to predict their potential as therapeutic antimicrobial agents. Each criterion contributes to an overall score.

Scoring Criteria

1. AMP Score (Antimicrobial Peptide Identification)

What it measures: The probability that a peptide is an antimicrobial peptide.
Threshold: 0.998
Scoring: Score > 0.998 → +1 point
Model: PeptideBERT-LSTM trained on curated AMP databases

2. E.coli Activity Score (Gram-negative Activity)

What it measures: Probability of antimicrobial activity against E.coli (Gram-negative bacteria).
Threshold: 0.9980798959732056
Scoring: Score > threshold → +1 point (active against E.coli)
Note: Only evaluated for broad-spectrum and E.coli-specific generations

3. S.aureus Activity Score (Gram-positive Activity)

What it measures: Probability of antimicrobial activity against S.aureus (Gram-positive bacteria).
Threshold: 0.9989857077598572
Scoring: Score > threshold → +1 point (active against S.aureus)
Note: Only evaluated for broad-spectrum and S.aureus-specific generations

4. Hemolytic Score (Safety Assessment)

What it measures: Probability of hemolytic activity (damage to red blood cells).
Threshold: 0.5
Scoring: Score ≤ 0.5 → +1 point (non-hemolytic, safe)
Importance: Critical for drug safety - hemolytic peptides are toxic

Overall Evaluation

For Screening & Broad-spectrum Generation (4-point scale):

Good Score = 4/4

Meets all criteria: AMP + E.coli + S.aureus + Non-hemolytic
Broad-spectrum antimicrobial with low toxicity
Highly recommended for further development

Mediocre Score = 2-3/4

Partially meets criteria
May have limited spectrum or safety concerns
Acceptable but requires evaluation

Bad Score = 0-1/4

Fails to meet most criteria
Not antimicrobial, inactive, or toxic
Not recommended

For Target-Specific Generation (3-point scale):

Good Score = 3/3

E.coli target: AMP + E.coli activity + Non-hemolytic
S.aureus target: AMP + S.aureus activity + Non-hemolytic
Optimal for targeted therapy

Mediocre Score = 2/3

Meets 2 out of 3 criteria
May lack activity or have safety concerns

Bad Score = 0-1/3

Fails most criteria
Not suitable for targeted therapy

🤖 5. Model Details & Thresholds

Model Architecture

AIPAMPDS uses state-of-the-art deep learning models based on transformer architectures and recurrent neural networks, trained on large-scale antimicrobial peptide databases.

Threshold Selection

Model thresholds are optimized to balance sensitivity and specificity:

High thresholds (e.g., 0.998) ensure high confidence predictions
Validated on experimental data from peer-reviewed publications
Minimize false positives to reduce wasted experimental effort

Training Data

AMP Identifier:

Trained on DBAASP, APD3, CAMP databases
~10,000+ validated AMP sequences
Negative samples from UniProt

Activity Classifiers:

Trained on experimental antimicrobial activity data
E.coli and S.aureus specific models
Binary classification: active vs inactive against specific bacteria

Hemolysis Predictor:

Trained on hemolytic activity assays
Critical for safety assessment

📊 Model Performance Metrics:
Our models achieve high accuracy on held-out test sets, with AUROC values > 0.95 for all classification tasks. However, computational predictions should always be validated experimentally before clinical use.

❓ 6. Frequently Asked Questions

Q: How long are my results stored?

Results are stored on our servers for 30 days. After this period, they may be automatically deleted to free up space. We recommend downloading your results promptly after job completion.

Q: Can I use the generated sequences commercially?

Yes, you can use the generated sequences for research and commercial purposes. However, we recommend:

Experimental validation before any application
Patent searches to ensure novelty
Consulting with legal experts for IP protection

Q: What if my job fails?

Job failures can occur due to:

Invalid input format (check FASTA formatting)
Server issues (temporary - try again later)
Sequences too long (>100 amino acids may cause issues)

If problems persist, contact us via the Contact page with your Job ID.

Q: How accurate are the predictions?

Our models achieve >95% accuracy on validation datasets. However:

Predictions are computational estimates, not guarantees
Experimental validation is essential
Novel sequences may behave differently than training data
Always test in lab conditions before applications

Q: Can I generate very long peptides?

Our generation models are optimized for peptides of 10-40 amino acids. While longer sequences can be screened, generation quality may decrease for very long peptides (>50 amino acids).

Q: What does "Broad-spectrum" mean?

Broad-spectrum antimicrobials are active against both Gram-positive and Gram-negative bacteria. This is valuable for treating infections with unknown or mixed bacterial populations.

Q: Why are some scores very close to the threshold?

Scores near the threshold (e.g., 0.9979 vs threshold 0.998) indicate borderline cases. These peptides may have marginal activity and should be interpreted with caution. We recommend focusing on peptides with scores significantly above thresholds.

Q: Can I request new features or models?

Yes! We welcome feedback and feature requests. Please use the Contact page to suggest new pathogens, additional properties to predict, or other enhancements.

📧 Still have questions? Contact us through the Contact page and we'll be happy to help!

🚀 Quick Reference Card

Generation

✓ Select target
✓ Set number of sequences
✓ Submit & track with Job ID

Screening

✓ Prepare FASTA format
✓ Upload or paste sequences
✓ Results sorted by score

Job Status

✓ Enter Job ID
✓ Check progress
✓ Download results (CSV)