|
|
|
Would you like to explore and research
the latest scientific ideas?
Do you enjoy mathematics and/or science?
If so, have you thought about joining the University of Queenslands
elite Advanced Studies Program (ASP)? One of the highlights of the ASP is
the opportunity to talk with and learn from researchers working at the frontiers
of science. In this program second year students are placed with leading research
teams and actively participate in research projects sounds exciting,
doesnt it!
You can find out more general information about the ASP on page 4. Below we
have a brief description of some of the mathematical research projects current
ASP students Jabin, Tracy and Marissa, are involved in.
Jabin Kirk, working with Diane Donovan, is looking for evidence of hidden
codes within protein-coding genes.
Cells store the instructions for the production of proteins in fragments of
DNA/RNA, known as genes. More precisely, RNA is made up of sequences of nucleotides,
G, A, U and C, and amino acids are subsequences of RNA of length three; that
is amino acids are encoded by triplets of nucleotides. To produce a protein,
cells locate specific regions or strands of RNA. The cell reads the nucleotides
on the strand in groups of three, with each such triplet identifying an amino
acid needed for the production of the protein. An interesting aspect of this
process is the fact that there are essentially only 20 different types of
amino acids, but 64 = 43 = 4 x 4 x 4 different triplets (as there
are four different nucleotides for each of the three positions of a triplet).
So two or more triplets are often used to encode the same amino acid. The
assignment of the 64 triplets to different types of amino acids is set out
in the table above and is known as the universal genetic code. Scientists,
including Jabin, are eager to know if this redundancy follows a strict pattern
and whether through evolution the universal genetic code has developed (mathematical)
coding theory properties, which minimizes the impact of mutations. A mutation
in a DNA/RNA strand occurs when one nucleotide is replaced by another, and
may result in the incorrect production of proteins because the amino acid
is changed.
Universal Genetic Code
| UUU (phe) |
UCU (ser) |
UAU (tyr) |
UGU (cys) |
| UUC (phe) |
UCC (ser) |
UAC (tyr) |
UGC (cys) |
| UUA (leu) |
UCA (ser) |
UAA (stop) |
UGA (stop) |
| UUG (leu) |
UCG (ser) |
UAG (stop) |
UGG (trp) |
| CUU (leu) |
CCU (pro) |
CAU (his) |
CGU (arg) |
| CUC (leu) |
CCC (pro) |
CAC (his) |
CGC (arg) |
| CUA (leu) |
CCA (pro) |
CAA (gln) |
CGA (arg) |
| CUG (leu) |
CCG (pro) |
CAG (gln) |
CGG (arg) |
| AUU (ile) |
ACu (thr) |
AAU (asn) |
AGU (ser) |
| AUC (ile) |
ACC (thr) |
AAC (asn) |
AGC (ser) |
| AUA (ile) |
ACA (thr) |
AAA (lys) |
AGA (arg) |
| AUG (start) |
ACG (thr) |
AAG (lys) |
AGG (arg) |
| GUU (val) |
GCU (ala) |
GAU (asp) |
GGU (gyl) |
| GUC (val) |
GCC (ala) |
GAC (asp) |
GGC (gyl) |
| GUA (val) |
GCA (ala) |
GAA (glu) |
GGA (gyl) |
| GUG (val) |
GCG (ala) |
GAG (glu) |
GGG (gyl) |
** (the notation UAC (tyr)
implies that the triplet of nucleotides UAC codes for, or corresponds to,
the amino acid tyrosine.)
To study the universal genetic code we
can begin by studying a simpler code and then try to extend the knowledge
we gain to explain the more complicated code. So consider a binary code, over
just two symbols 0 and 1; that is, 2x2x2= 23 = 8 triplets, 000, 001, 010,
011, 100, 101, 110, 111. A single mutation in a triplet means a 0 will change
to a 1 or vice versa.

Jabin has represented
this concept of mutation by the mathematical diagram shown above. Jabin uses
dots to represent triplets, and two triplets are joined by a line if one triplet
can be obtained from the other by a single mutation. Now let a triplet say,
001, code for an amino acid. A single mutation in 001 gives 000, 101 or 011,
and so if a cell reads any one of the three triplets 000, 101 or 011 it should
recognize that they all correspond to the same amino acid as 001 and so should
decode back to 001. This leaves four triplets and if we choose
110 for a second amino acid, then a single mutation gives 010, 100 or 111,
and a cell should decode each of these words back to the same
amino acid as 110. In the diagram the big dots represent the code words and
if a triplet corresponding to a small dot is received it should be corrected
back to the closest big dot. If you study the diagram, you will find other
similar codes. (For instance, let 000 represent one amino acid;
then 111 should represent the other amino acid, and we leave you to apportion
the remaining triples to these amino acids, so that a cell can always correct
a single mutation.) However all these codes have three special
properties: exactly two amino acids, say X and Y, can be encoded; the triplets
corresponding to X and Y are separated by at least two dots; and the triplets
joined to X by a line also code for amino acid X and similarly for Y. This
is the best possible code for correcting single mutations. What is interesting
is that Jabins approach can be extended to studying codes based on four
different symbols G, A, U and C. However, his analysis shows that 64 triplets
is not enough to have the facility to correct single mutations for 20 code
words of length 3 and that the code words would need to be longer
than triplets. Jabins ideas differ from research published in the Journal
of Theoretical Biology, which suggests this modeling approach might provide
evidence of an underlying code. What is exciting about Jabins research
is that he has produced mathematical results which challenge us to question
existing theory. Hopefully Jabins ideas will lead to further discussion
and a more rigorous analysis of the structure of the universal genetic code.
Tracy Rout, a third
year student, has been working for the last two years with Hugh Possinghams
research team designing better marine ecology reserves.
In a world where economic and political agendas compete with conservation
goals, it is important to ensure that terrestrial and marine reserves are
designed to maximise and conserve biodiversity while minimising and compacting
the area to be maintained. Tracy has been using a computer program called
Marxan (created by Ian Ball and Hugh Possingham) to design marine reserve
systems for South Australia. Marxan uses simulated annealing to maximise the
ecological aspects while minimising the cost the weighted
sum of area and boundary length of the reserve system. So Tracy has
constructed cost functions based on the boundary length and area of the reserve
and then used mathematical techniques to find the minimum value for this function.
For zones currently being planned by the South Australian Department of Environment
and Heritage, Tracy has been studying and designing reserves on three different
scales: for the area as a whole; by bioregion; and by marine sections. By
comparing the efficiency, compactness and differences in representation between
these reserves, Tracy hopes to uncover the effect of planning at divergent
scales. She is currently writing up her results and hopes to have them published
in a paper.
Marissa McBride also
worked with Hugh Possinghams research team in 2003.
Her brief was to set up a demographic (age structured) population transition
matrix and use Markov chain and Monte Carlo methods to determine the optimal
parameters (eg birth/death/growth rates at each stage) for a model of a snail
population. Marissa then tested the performance of her model by comparing
it against data taken from field experiments. Marissa wrote a basic Matlab
program for a simplified version of the model, and generated the necessary
data. Her supervisor, Chris Wilcox, is currently collaborating with statisticians
and conducting a statistical analysis of Marissas results.

©
Discipline of Mathematics, The
University of Queensland.
|
|
|