DNA sequence data is the most abundant material with which to begin a project in computational biology. Raw sequences from genomes have to be analyzed and annotated, in ways that improve continuously as the databases expand and sharper methods are used.