📚 AIPAMPDS User Guide
Welcome to the AIPAMPDS (AI-based Antimicrobial Peptide Design System) help center.
This guide will help you understand how to use all features of the platform.
🧬 1. Generation Module
Overview
The Generation module uses deep learning models to generate novel antimicrobial peptides (AMPs)
with specific properties based on your target pathogen selection. Generated peptides are automatically
evaluated for their antimicrobial activity, safety profile, and overall therapeutic potential.
How to Use
-
Navigate to the Generation Page
Click on "Generation" in the navigation menu.
-
Select Target Pathogen
Choose one of the following options:
- Broad-spectrum AMPs: Generates peptides active against both Gram-positive and Gram-negative
bacteria
- Gram-negative-specific AMPs: Generates peptides targeting Gram-negative bacteria (e.g.,
Escherichia coli)
- Gram-positive-specific AMPs: Generates peptides targeting Gram-positive bacteria (e.g.,
Staphylococcus aureus)
-
Set Number of Sequences
Specify how many peptide sequences you want to generate (typically 10-100 sequences recommended).
-
Submit Job
Click the "Generate" button to start the generation process. You'll receive a unique Job ID.
-
Track Progress
Use the Job ID to monitor your job in the Job Status page or wait on the current page for results.
Example Output
Here's what the generated results look like:
| Sequence |
AMP_Score |
E.coli_Score |
S.aureus_Score |
Hemolytic_Score |
Score |
Evaluate |
| GLFDIVKKVVGAFGSL |
0.9995 |
0.9985 |
0.9992 |
0.1234 |
4 |
Good |
| KWKLFKKIGIGAVLKVL |
0.9987 |
0.9982 |
0.9988 |
0.6543 |
3 |
Mediocre |
| FFHHIFRGIVHVGKTIHRLVTG |
0.9923 |
0.9875 |
0.9801 |
0.2345 |
1 |
Bad |
💡 Note about Target-Specific Results:
- Broad-spectrum: Shows all 4 scores (AMP, E.coli, S.aureus, Hemolytic)
- Maximum score:
4/4
- Gram-negative-specific target: Shows AMP, E.coli, and Hemolytic scores only - Maximum
score: 3/3
- Gram-positive-specific target: Shows AMP, S.aureus, and Hemolytic scores only -
Maximum score: 3/3
💡 Note: If a peptide is clearly not an antimicrobial peptide, its sequence will be
filtered out (for example, peptides longer than 50 amino acids or shorter than 5 amino acids) and will not
appear in the final results.
Download Results
Once the job is complete, you can download the results as a CSV file containing all generated sequences
and their evaluation scores. The file name will include your Job ID for easy reference.
💡 Job Processing Time:
- Typically 15 seconds for 100 sequences
- Processing time varies based on server load and number of sequences
🔬 2. Screening Module
Overview
The Screening module evaluates your own peptide sequences using our trained models.
Upload a list of peptide sequences, and the system will predict their antimicrobial properties,
antibacterial activity score, and hemolytic score.
How to Use
-
Navigate to the Screening Page
Click on "Screening" in the navigation menu.
-
Prepare Your Sequences
Format your peptide sequences as follows:
>peptide_1
GLFDIVKKVVGAFGSL
>peptide_2
KWKLFKKIGIGAVLKVL
>peptide_3
FFHHIFRGIVHVGKTIHRLVTG
- FASTA format with header lines starting with ">"
- One sequence per peptide
- Amino acid sequences using standard single-letter codes
- Maximum file size: 5MB
-
Upload Sequences
You can either:
- Paste sequences directly into the text area, or
- Upload a FASTA file (.fasta, .fa, .txt)
-
Submit Job
Click "Screen Sequences" to start the analysis. You'll receive a unique Job ID.
-
Review Results
Results are sorted by score (highest to lowest) to help you identify the best candidates quickly.
Example Input
>seq1_test_peptide
GLFDIVKKVVGAFGSL
>seq2_candidate
KWKLFKKIGIGAVLKVL
>seq3_experimental
FFHHIFRGIVHVGKTIHRLVTG
>seq4_novel_amp
GIGKFLHSAKKFGKAFVGEIMNS
Example Output
Here's what the screening results look like:
| Sequence |
AMP_Score |
E.coli_Score |
S.aureus_Score |
Hemolytic_Score |
Score |
Evaluate |
| GLFDIVKKVVGAFGSL |
0.9995 |
0.9985 |
0.9992 |
0.1234 |
4 |
Good |
| GIGKFLHSAKKFGKAFVGEIMNS |
0.9989 |
0.9983 |
0.9990 |
0.3456 |
4 |
Good |
| KWKLFKKIGIGAVLKVL |
0.9987 |
0.9982 |
0.9988 |
0.6543 |
3 |
Mediocre |
| FFHHIFRGIVHVGKTIHRLVTG |
0.9923 |
0.9875 |
0.9801 |
0.2345 |
1 |
Bad |
💡 Note: If a peptide is clearly not an antimicrobial peptide, its sequence will be
filtered out (for example, peptides longer than 50 amino acids or shorter than 5 amino acids) and will not
appear in the final results.
Download Results
Download results as CSV for further analysis in Excel, R, Python, or other tools.
Results include all original sequence names for easy tracking.
💡 Job Processing Time:
- Typically 9 seconds for 100 sequences
- Processing time varies based on server load and number of sequences
📊 3. Job Status Module
Overview
The Job Status module allows you to track the progress of your submitted jobs,
view completed results, and download output files.
How to Use
-
Navigate to Job Status Page
Click on "Job Status" in the navigation menu.
-
Enter Your Job ID
Paste the Job ID you received when submitting your generation or screening job.
Example Job ID: 9a972711-7516-4d61-a30d-c45731daa5b1
-
Check Status
Click "Check Status" to view your job information. You'll see:
- Status: PENDING, PROCESSING, COMPLETED, or FAILED
- Progress: Percentage complete (if available)
- Type: Generation or Screening
- Created: Submission timestamp
-
Download Results
Click the "Download Results" button to save the complete CSV file to your computer.
⚠️ Important: Save your Job ID! Results are stored for 30 days,
after which they may be automatically deleted to save server space.
🎯 4. Scoring System Explained
Overview
Our scoring system evaluates peptides based on multiple criteria to predict their potential
as therapeutic antimicrobial agents. Each criterion contributes to an overall score.
Scoring Criteria
1. AMP Score (Antimicrobial Peptide Identification)
What it measures: The probability that a peptide is an antimicrobial peptide.
Threshold: 0.998
Scoring: Score > 0.998 → +1 point
Model: PeptideBERT-LSTM trained on curated AMP databases
2. E.coli Activity Score (Gram-negative Activity)
What it measures: Probability of antimicrobial activity against E.coli (Gram-negative
bacteria).
Threshold: 0.9980798959732056
Scoring: Score > threshold → +1 point (active against E.coli)
Note: Only evaluated for broad-spectrum and E.coli-specific generations
3. S.aureus Activity Score (Gram-positive Activity)
What it measures: Probability of antimicrobial activity against S.aureus (Gram-positive
bacteria).
Threshold: 0.9989857077598572
Scoring: Score > threshold → +1 point (active against S.aureus)
Note: Only evaluated for broad-spectrum and S.aureus-specific generations
4. Hemolytic Score (Safety Assessment)
What it measures: Probability of hemolytic activity (damage to red blood cells).
Threshold: 0.5
Scoring: Score ≤ 0.5 → +1 point (non-hemolytic, safe)
Importance: Critical for drug safety - hemolytic peptides are toxic
Overall Evaluation
For Screening & Broad-spectrum Generation (4-point scale):
Good Score = 4/4
- Meets all criteria: AMP + E.coli + S.aureus + Non-hemolytic
- Broad-spectrum antimicrobial with low toxicity
- Highly recommended for further development
Mediocre Score = 2-3/4
- Partially meets criteria
- May have limited spectrum or safety concerns
- Acceptable but requires evaluation
Bad Score = 0-1/4
- Fails to meet most criteria
- Not antimicrobial, inactive, or toxic
- Not recommended
For Target-Specific Generation (3-point scale):
Good Score = 3/3
- E.coli target: AMP + E.coli activity + Non-hemolytic
- S.aureus target: AMP + S.aureus activity + Non-hemolytic
- Optimal for targeted therapy
Mediocre Score = 2/3
- Meets 2 out of 3 criteria
- May lack activity or have safety concerns
Bad Score = 0-1/3
- Fails most criteria
- Not suitable for targeted therapy
🤖 5. Model Details & Thresholds
Model Architecture
AIPAMPDS uses state-of-the-art deep learning models based on transformer architectures
and recurrent neural networks, trained on large-scale antimicrobial peptide databases.
Threshold Selection
Model thresholds are optimized to balance sensitivity and specificity:
- High thresholds (e.g., 0.998) ensure high confidence predictions
- Validated on experimental data from peer-reviewed publications
- Minimize false positives to reduce wasted experimental effort
Training Data
AMP Identifier:
- Trained on DBAASP, APD3, CAMP databases
- ~10,000+ validated AMP sequences
- Negative samples from UniProt
Activity Classifiers:
- Trained on experimental antimicrobial activity data
- E.coli and S.aureus specific models
- Binary classification: active vs inactive against specific bacteria
Hemolysis Predictor:
- Trained on hemolytic activity assays
- Critical for safety assessment
📊 Model Performance Metrics:
Our models achieve high accuracy on held-out test sets, with AUROC values > 0.95
for all classification tasks. However, computational predictions should always be
validated experimentally before clinical use.
❓ 6. Frequently Asked Questions
Q: How long are my results stored?
Results are stored on our servers for 30 days. After this period, they may be
automatically deleted to free up space. We recommend downloading your results
promptly after job completion.
Q: Can I use the generated sequences commercially?
Yes, you can use the generated sequences for research and commercial purposes.
However, we recommend:
- Experimental validation before any application
- Patent searches to ensure novelty
- Consulting with legal experts for IP protection
Q: What if my job fails?
Job failures can occur due to:
- Invalid input format (check FASTA formatting)
- Server issues (temporary - try again later)
- Sequences too long (>100 amino acids may cause issues)
If problems persist, contact us via the Contact page with your Job ID.
Q: How accurate are the predictions?
Our models achieve >95% accuracy on validation datasets. However:
- Predictions are computational estimates, not guarantees
- Experimental validation is essential
- Novel sequences may behave differently than training data
- Always test in lab conditions before applications
Q: Can I generate very long peptides?
Our generation models are optimized for peptides of 10-40 amino acids.
While longer sequences can be screened, generation quality may decrease
for very long peptides (>50 amino acids).
Q: What does "Broad-spectrum" mean?
Broad-spectrum antimicrobials are active against both Gram-positive and
Gram-negative bacteria. This is valuable for treating infections with
unknown or mixed bacterial populations.
Q: Why are some scores very close to the threshold?
Scores near the threshold (e.g., 0.9979 vs threshold 0.998) indicate
borderline cases. These peptides may have marginal activity and should
be interpreted with caution. We recommend focusing on peptides with
scores significantly above thresholds.
Q: Can I request new features or models?
Yes! We welcome feedback and feature requests. Please use the Contact
page to suggest new pathogens, additional properties to predict, or
other enhancements.
📧 Still have questions? Contact us through the
Contact page
and we'll be happy to help!
🚀 Quick Reference Card
Generation
✓ Select target
✓ Set number of sequences
✓ Submit & track with Job ID
Screening
✓ Prepare FASTA format
✓ Upload or paste sequences
✓ Results sorted by score
Job Status
✓ Enter Job ID
✓ Check progress
✓ Download results (CSV)