🔍 BLASTN: Nucleotide-Nucleotide BLAST
BLASTN (Basic Local Alignment Search Tool for Nucleotides) is a fundamental bioinformatics tool used to compare a nucleotide (DNA or RNA) query sequence against a nucleotide sequence database. It identifies regions of local similarity between nucleic acid sequences, helping to infer functional and evolutionary relationships.
❓ What is BLASTN?
BLASTN takes a DNA or RNA query sequence and searches it directly against a chosen nucleotide sequence database. It identifies regions of significant similarity, providing a list of hits (matching sequences) and their alignments. This is the most common BLAST program for direct nucleotide sequence comparison.
- Nucleotide Query vs. Nucleotide Database: Compares nucleotide to nucleotide.
- Local Alignment: Finds regions of highest similarity.
- Homology Inference: Helps deduce function and evolutionary relationships for nucleic acids.
🎯 Why Use BLASTN? For DNA/RNA Homology & Gene Identification
BLASTN is indispensable for:
- 🔍 Gene Identification: Finding known genes or homologous sequences in a newly sequenced genome or transcript.
- 🧬 Primer/Probe Specificity: Checking the specificity of PCR primers or hybridization probes against a genome or transcript database.
- 📊 Sequence Verification: Confirming the identity of a cloned DNA fragment.
- 🎯 SNP/Variant Detection: Identifying single nucleotide polymorphisms (SNPs) or other small variants when comparing a query to a reference.
- 📈 Contamination Detection: Identifying contaminating sequences in your sample.
🧑💻 How to Use BLASTN on Job Dispatcher: A Step-by-Step Guide
Follow these simple steps to perform a nucleotide-nucleotide BLAST search:
1️⃣ Navigate to the Tool
- From the main menu, go to All Tools (or search for "BLASTN").
- Click the prominent Use Tool button located next to "BLASTN."
2️⃣ Input Your Nucleotide Sequence
Locate the input box (large text area) or the "upload a Sequence File" option.
Paste your nucleotide (DNA or RNA) sequence(s) in FASTA format or upload a FASTA file.
>my_dna_query ATGGCCATGGCACTAGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCATGImportant: You can provide a sequence either by typing into the text area OR by uploading a file, but not both simultaneously. Please clear one input to proceed.
3️⃣ Configure Parameters
📝 Title: Provide a descriptive title for your job (e.g., "My BLASTN Search").
💡 Sequence Type: (Automatically set to DNA for BLASTN, as it expects nucleotide input).
🗄️ Databases: Select one or more nucleotide databases to search against.
- Default:
em_est_env,em_gss_env,em_htc_env,em_htg_env,em_pat_env,em_std_env,em_sts_env,em_tsa_env - (Many other options available in the Nucleotide Databases Tree on the form)
- Default:
⚙️ TASK (
task): Choose the specific BLASTN task.blastn- Default (Standard nucleotide-nucleotide BLAST)megablast(Optimized for highly similar sequences, faster)
📝 INCL. TAXONOMY IDs (
taxids): Enter taxonomy IDs separated by commas (e.g., 9606, 10090, 7227).📝 EXCL. TAXONOMY IDs (
negative_taxids): Enter taxonomy IDs separated by commas (e.g., 9606, 10090, 7227).➕ MATCH/MISMATCH SCORES (
match_scores): Define scores for matches and mismatches.- Default:
1,-3 - Options:
1,-4,2,-7,1,-3,2,-5,1,-2,2,-3,1,-1,5,-4,4,-5
- Default:
➖ Gap Open (
gapopen): The penalty for opening a new gap.- Default:
5 - Options:
-1,0,1, ...,25
- Default:
➖ Gap Extend (
gapext): The penalty for extending an existing gap.- Default:
2 - Options:
-1,0,1, ...,10
- Default:
📉 EXP.THR (
exp): The Expectation Value (E-value) threshold. Matches with E-values higher than this will not be reported. Lower values are stricter.- Default:
10 - Options:
1e-200,1e-100,1e-50,1e-10,1e-5,1e-4,0.001,0.01,0.1,1.0,10,100,1000,20,50
- Default:
🧹 FILTER (
filter): Apply a low-complexity filter (e.g., DUST filter for nucleotide).yes(T) - Defaultno(F)
🗑️ DROPOFF (
dropoff): Dropoff value for the Gapped BLAST algorithm.- Default:
0 - Options:
2,4,6,8,10
- Default:
🔢 SCORES (
scores): Maximum number of scores to report.- Default:
50 - Options:
0,5,10,20,50,100,150,200,250,500,750,1000
- Default:
↔️ ALIGNMENTS (
alignments): Maximum number of alignments to report.- Default:
50 - Options:
0,5,10,20,50,100,150,200,250,500,750,1000
- Default:
📏 SEQUENCE RANGE (
seqrange): Define a specific range within the query sequence to search.- Default:
START-END(entire sequence)
- Default:
🔢 HSPS (
hsps): Maximum number of High-scoring Segment Pairs (HSPs) to report.- Default:
100 - Input type: Number
- Default:
↔️ GAPALIGN (
gapalign): Perform gapped alignments.true- Defaultfalse
👁️ ALIGN VIEWS (
align): Choose the format for displaying alignments.0(pairwise) - Default1(Query-anchored identities),2(Query-anchored non-identities),3(Flat query-anchored identities),4(Flat query-anchored non-identities)5(BLASTXML),6(Tabular),7(Tabular with comment lines),8(Text ASN.1),9(Binary ASN.1),10(Comma-separated values),11(BLAST archive format (ASN.1)),12(Tabular with comment lines with btop)
📏 WORD SIZE (
wordsize): The length of the initial exact match (seed) required to initiate an alignment.- Default:
11 - Input type: Number
- Default:
4️⃣ Submit Your Job
- Once your sequence is entered and parameters are set, click the Submit or Run button.
- Your job will be dispatched to the EMBL-EBI Web Service. You will be automatically redirected to a Job Status page to monitor its progress.
5️⃣ Interpret Results
- On the results page, you will find a summary of your BLASTN search, including a graphical overview of matches, a table of significant alignments, and detailed pairwise alignments.
- Pay attention to the E-value (Expectation Value), which indicates the number of hits you would expect to see by chance. Lower E-values mean more significant matches.
- ⭐ Tip: For very similar sequences, consider using the
megablasttask for faster results.
💬 Need Help?
If you run into issues, please visit our Contact Us page for support. Happy BLASTing!