🔍 BLASTP: Protein-Protein BLAST
BLASTP (Basic Local Alignment Search Tool for Proteins) is a fundamental bioinformatics tool used to compare a protein query sequence against a protein sequence database. It identifies regions of local similarity between sequences, helping to infer functional and evolutionary relationships.
❓ What is BLASTP?
BLASTP takes an amino acid (protein) query sequence and searches it against a chosen protein sequence database. It identifies regions of significant similarity, providing a list of hits (matching sequences) and their alignments. This is the most common BLAST program for protein sequence comparison.
- Protein Query vs. Protein Database: Compares protein to protein.
- Local Alignment: Finds regions of highest similarity.
- Homology Inference: Helps deduce function and evolutionary relationships.
🎯 Why Use BLASTP? For Protein Homology & Function Prediction
BLASTP is indispensable for:
- 🔍 Functional Annotation: Predicting the function of a novel protein by finding homologous proteins with known functions.
- 🧬 Homology Search: Identifying evolutionarily related proteins across different species.
- 📊 Protein Family Classification: Assigning a protein to a known family or superfamily.
- 🎯 Target Identification: Finding similar proteins that could be drug targets or have specific binding properties.
- 📈 Gene Discovery: Identifying new genes by searching translated genomic regions against protein databases.
🧑💻 How to Use BLASTP on Job Dispatcher: A Step-by-Step Guide
Follow these simple steps to perform a protein-protein BLAST search:
1️⃣ Navigate to the Tool
- From the main menu, go to All Tools (or search for "BLASTP").
- Click the prominent Use Tool button located next to "BLASTP."
2️⃣ Input Your Protein Sequence
Locate the input box (large text area) or the "upload a Sequence File" option.
Paste your protein sequence(s) in FASTA format or upload a FASTA file.
>my_protein_query MGDVEKGKKIFIMKCSQCHTVEKGGKHKTGPNLHGLFGRKTGQAPGYSYTAANKNKGIIWGEDTLMEYLENPKKYIPGTKMIFVGIKKKEERADLIAYLKKATNEImportant: You can provide a sequence either by typing into the text area OR by uploading a file, but not both simultaneously. Please clear one input to proceed.
3️⃣ Configure Parameters
📝 Title: Provide a descriptive title for your job (e.g., "My BLASTP Search").
💡 Sequence Type: (Automatically set to Protein for BLASTP).
🗄️ Databases: Select one or more protein databases to search against.
- Default:
uniprotkb_swissprot - (Many other options available in the Protein Databases Tree on the form)
- Default:
📝 INCL. TAXONOMY IDs (
taxids): Enter taxonomy IDs separated by commas (e.g., 9606, 10090, 7227).📝 EXCL. TAXONOMY IDs (
negative_taxids): Enter taxonomy IDs separated by commas (e.g., 9606, 10090, 7227).📊 Matrix (
matrix): Select the scoring matrix to use for protein alignments.BLOSUM62- DefaultBLOSUM45,BLOSUM50,BLOSUM80,BLOSUM90PAM30,PAM70,PAM250NONE(M10)
➖ Gap Open (
gapopen): The penalty for opening a new gap.- Default:
11 - Options:
-1,0,1, ...,25
- Default:
➖ Gap Extend (
gapext): The penalty for extending an existing gap.- Default:
1 - Options:
-1,0,1, ...,10
- Default:
📉 EXP.THR (
exp): The Expectation Value (E-value) threshold. Matches with E-values higher than this will not be reported. Lower values are stricter.- Default:
10 - Options:
1e-200,1e-100,1e-50,1e-10,1e-5,1e-4,0.001,0.01,0.1,1.0,10,100,1000,20,50
- Default:
🧹 FILTER (
filter): Apply a low-complexity filter (e.g., SEG filter for protein).no(F) - Defaultyes(T)
🗑️ DROPOFF (
dropoff): Dropoff value for the Gapped BLAST algorithm.- Default:
0 - Options:
2,4,6,8,10
- Default:
🔢 SCORES (
scores): Maximum number of scores to report.- Default:
50 - Options:
0,5,10,20,50,100,150,200,250,500,750,1000
- Default:
↔️ ALIGNMENTS (
alignments): Maximum number of alignments to report.- Default:
50 - Options:
0,5,10,20,50,100,150,200,250,500,750,1000
- Default:
📏 SEQUENCE RANGE (
seqrange): Define a specific range within the query sequence to search.- Default:
START-END(entire sequence)
- Default:
🔢 HSPS (
hsps): Maximum number of High-scoring Segment Pairs (HSPs) to report.- Default:
100 - Input type: Number
- Default:
↔️ GAPALIGN (
gapalign): Perform gapped alignments.true- Defaultfalse
👁️ ALIGN VIEWS (
align): Choose the format for displaying alignments.0(pairwise) - Default1(Query-anchored identities)2(Query-anchored non-identities)3(Flat query-anchored identities)4(Flat query-anchored non-identities)5(BLASTXML)6(Tabular)7(Tabular with comment lines)8(Text ASN.1)9(Binary ASN.1)10(Comma-separated values)11(BLAST archive format (ASN.1))12(Tabular with comment lines with btop)
📊 COMPOSITION-BASED (
compstats): Use composition-based statistics.F- DefaultD,1,2,3
📏 WORD SIZE (
wordsize): The length of the initial exact match (seed) required to initiate an alignment.- Default:
6 - Input type: Number
- Default:
4️⃣ Submit Your Job
- Once your sequence is entered and parameters are set, click the Submit or Run button.
- Your job will be dispatched to the EMBL-EBI Web Service. You will be automatically redirected to a Job Status page to monitor its progress.
5️⃣ Interpret Results
- On the results page, you will find a summary of your BLASTP search, including a graphical overview of matches, a table of significant alignments, and detailed pairwise alignments.
- Pay attention to the E-value (Expectation Value), which indicates the number of hits you would expect to see by chance. Lower E-values mean more significant matches.
- ⭐ Tip: Explore the detailed alignments to understand the exact matching regions and any gaps or mismatches.
💬 Need Help?
If you run into issues, please visit our Contact Us page for support. Happy BLASTing!