As a part of the Human Genome Project, large scale sequencing of cDNA clones from various tissues have been performed and many cDNA sequences have been stored in the public databases. From a lot of sequence data, to obtain more useful biological information, it is indispensable to refine and classify them [1]. We developed a prototype system for refinement and classification of many sequence data.