Using the same unknown sequence provided at the bottom of this document, perform
ID: 211573 • Letter: U
Question
Using the same unknown sequence provided at the bottom of this document, perform a BLASTX search using “Database” option set to "Reference proteins (refseq_protein)" and “Organism” set to “vertebrates (taxid:7742):
Keep all other settings and parameters at the default value.
2.a. How many Blast Hits do you get for the sequence?
Answer =
2.b. What non-PREDICTED hit is the sequence most similar to? Provide the Sequence Accession as your answer.
Answer =
2.c. What is the source organism for this hit?
Answer =
2.d What is the E-value for this hit?
Answer =
2.e. What is the Max identity for this hit?
Answer =
2.f. When you inspect the alignment for this hit, what is the Length of the amino acid sequence hit?
Answer =
2.g. When you inspect the alignment for this hit, what is the Frame the query sequence was translated in?
Answer =
2.h. When you inspect the alignment for this hit, what positions of the translated “Query” sequence align to the “Subject” (i.e., hit) sequence?
Answer =
2.i. When you inspect the alignment for this hit, what positions of the “Subject” (i.e., hit) sequence are aligned to by the translated “Query” sequence?
Answer =
2.j. When you inspect the alignment for this hit, what is the % Gap listed?
Answer =
2.k. When you inspect alignments for the other hits returned from the BLASTX search, is the translation Frame consistently the same as your answer to 2.g. or different?
Answer =
2.l. From your answer to 2.k., what then is the most likely translation Frame for the unknown query sequence that produces a functional protein?
Answer =
2.m. When you inspect the BLASTX search results from your answer to 2.a., what is the most likely protein that the query sequence produces?
Answer =
>Unknown_Sequence
GATATTCACATGATCGCAAATGACCCGCGGCTCCGGCGGCAGCGTCTCAAGCGCCCTCCGCCCGCTCGCAGCAGCAGTAAGTGTGCACGACTGTGCAAGTGTGAGAGCATCATGGTAGCATTCAAAGGAGTCTGGACTCATCCCTTCTGGAAAGCCGTTTCAGCAGAATTTTTGGTCATGTTGATTTTTGTCCTCCTCAGCCTTGGCTCTACGATCAACTGGGGTGGATCAGAGAAGCCACTGCCCGTAGACATGGTCCTTATCTCTCTCTGCTTTGGACTGAGCATTGCAACCATGGTGCAGTGCTTTGGACACATCAGCGGTGGCCACATCAACCCTGCTGTGACTGTGGCCATGGTCTGCACAAGAAAGATCAGCCTCGCCAAATCGGTCTTCTACATTCTTGCCCAGTGCCTGGGAGCCATCGTGGGAGCTGGCATCCTCTACCTCATCACACCACCGAGTGTGGTGGGAGGCCTGGGAGTCACTGCGGTACACGGGGATCTTTCCGCTGGCCATGGACTCCTGGTGGAGCTGATAATTACATTTCAGCTGGTTTTTACTATTTTTGCCAGCTGTGATTCAAAACGAAGTGATGTCACTGGTTCAGTAGCTCTAGCAATTGGATTTTCTGTTGCAATTGGACATTTATTTGCTATCAATTACACTGGTGCCAGTATGAACCCTGCTCGCTCATTTGGACCTGCTGTCATCATGGGAAAATGGGAAAACCAATGGGTTTATTGGGTGGGGCCAATAATAGGAGCAGTCCTTGCTGGTGCTCTTTATGAGTATGTCTATTGCCCAGACGTGGAGCTCAAACGCCGTTTTAAAGATGTCTTCAGTAAAGCTACTCAGCCATCCAAAGGGAAGTACATAGAAGTGGATGACACCAGGAGCCACGTAGAGACCGATGACCTGATCCTGAAGCCTGGCATTGTCCACGTGATTGATATTGACAGGAGTGAGGACAAGAAGGGAAGAGATCCATCAAGTGAGGTGCTGTCTTCTGTATGACTAGCAAGGAGCACTGAAAGCAGAGAGCAGCCTGCCAGCGACTCCACAGATATCCTTCCACCTATCAAACAGAGAGCAGCCTGCCAGCGACTCCACAGATATCCTTCCACCTATCAAAGAAACAGATCTCCTCTAAACAGAGCATCTATCATTTCTTAAAAAGTGTGGTGAAGGCAGCTGTGTGGTAGTGGCATCACCAAACCATACATCTGCTCAGCTGGAATATTAGGACTTCATTATAATTAGGATTCCCCACGAATTATTCTAAATTTGGAGGTGTTCCTGCAATTTTCCTTCCTTTCTTTCTGGAACAACCCCAAAGTCAAAAAGAGATGAAAGCACTCTTCTTTAATAAATCAGTCAATAATGAGATGAAGATAGAGCTGTTTAACATTCAGATTGACAGATAAGATGTATCAGGAAATGCCTATAGACATGAAGACCTACTTATCAGATTGTTCTTCTGACACTTAATTGACTGTTGTCATCTTTGATTAGAACATCTTATCCATTAAGCATTCTCTGTGAGGTTCAGGGACAGCACCAACAGTATTTAACAGTTTTATCCAAAGTCAAGCAATGGAGTATTGTTTCCACTTCACGTATGTCACTACTACCTTGCAAACTACCTCTGAAATAAATGATATTTTAATAGGCTCCAGAAAAAAATTCGATCAACCCATCAAATTTCACTCATACGATTTTCTGTATAAATGTATTACCTTCATCTCTTCCCAGGAGTAAATATGCTGAAATTGAATATTGAAGTCTACCGTAATAAGGCTGCAAAGCATTTAACTGTACTTCATGCTCACTTTTGACATTGTCTATCTGGTGAAACATTCTCCTGGGGTTTGACTATTGACCATTTCATGTTAGCAGACTCTCAAGGTCACG
Choose Search Set Database Reference proteins (refseq-protein) Organisnm Optional Exclude + vertebrates (taxid:7742)Explanation / Answer
2a. 100 hits were obtained when the above sequence was blasted.
2b. non-predicted hits cannot be commented upon as the sequence is similar to aquaporin-4 isoform 1 of many organisms.
2c. The source organism for the hit is Gallus gallus.
2d. E value for this hit is 0.0
2e. Max score for this protein is 640.
2f. length of amino acid sequence is 335.
Related Questions
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.