The QTL data and related information for bovine QTL are gathered from published work and from existing databases such as INRA BOVMAP [3
] and USDA-MARC[1
]. We elected to use the USDA-MARC map as our reference map and to display all QTL on the MARC map. This was done as a matter of convenience, as the MARC map is the most comprehensive bovine linkage map available. Every QTL mapping experiment results in the generation of a new map within which the QTL are localized. In order to represent QTL from different experiments on a single map, we needed to use a map that incorporated the markers used to define QTL. The MARC map fulfilled this criterion. In a small number of cases, markers have been used to map QTL and are not present on the MARC map. In these cases, the closest USDA marker that flanked the QTL marker were used as a proxy. An integrated database schema was designed and the database (MySQL) populated with the gathered data. Quarterly literature searches will allow frequent updating of the database. The bovine QTL Viewer was developed for the integration of QTL data available for cattle. The tool consists of an integrated database of bovine QTL and the QTL viewer to display QTL and their chromosomal position. As shown in Fig. , the database mainly consists of QTL data and Marker data.
Figure 1 QTL viewer description. This is a diagrammatic view describing the inputs and outputs from the database and viewer. QTL and marker information are assembled from the literature or web sites and entered into the linked, discrete marker and QTL databases. (more ...)
The database is independent of the viewer, which dynamically displays the query results (using the USDA MARC map as a the reference map) as graphical representations of chromosomes highlighted in the QTL regions (Fig. ). Because the database is independent from the viewer it is a simple matter to add databases for other species making this a more versatile tool. The interface was designed after initially canvassing potential users and two rounds of feedback to ensure user desired features were incorporated into the design.
Figure 2 Genome view of multiple QTL. This screen shot shows the result of a query for QTL for carcass quality, carcass yield, milk yield and milk fat traits. The QTL regions are color coded and for ease of viewing multiple, overlapping QTL for any single trait (more ...)
The user can then select a particular chromosome to view in detail by clicking on it. This chromosome view provides a detailed view of all QTL reported in the genome view generated previously. The user can go directly to the chromosome view by selecting a single chromosome at the time of the initial search. Individual QTL can then be viewed in detail (Fig. ) and at this point the user can determine statistical significance, family structure, literature references and markers included within the QTL region. A future link out to a genome browser will be available from this QTL detail view.
Figure 3 Detailed view of QTL. The critical interval of the QTL is shown along with peak marker location. QTL summary data such as QTL ID, statistical signficance, family structure and statistical methods are presented. Links to the USDA marker database are provided, (more ...)
As can be seen from the Entity-Relationship (ER) diagram (Fig. ), links to genome data can be provided by adding a relationship linking markers to their genome coordinates in the Gbrowse database.
Figure 4 Entity-Relationship diagram of the integrated database. The integrated database consists of two major, linked divisions, the marker database, which is independent from the QTL database. This allows easy updating of either component of the database or (more ...)
The ER diagram consists of these entities:
• QTL information, including all the details of each QTL
• Marker information, including all the details of each marker
• References, including the complete list of references
• Trait information, including the details of the traits of the cattle
• Category information, which gives details about the categories into which traits are divided
• Chromosome information, which has the length of each of chromosome measured in linkage units (Morgans).
The database is modeled such that each trait can fall under multiple categories. In addition, each QTL and marker record can have more than one reference. Finally, the QTL information and marker information are not linked directly. These data are linked dynamically by the QTL Viewer based on the position of the marker and the position of the QTL on the chromosome. This provides flexibility when the database is updated, allowing marker information and QTL information to be updated independently of each other.
Data can be added or edited using a set of web based tools that simplify database administration. Database administrators can add single QTL or easily modify QTL attributes without resorting to the command line. This makes updating the database possible by less skilled individuals such as students or technicians who may not have the training to mine the data out of the literature or have admin access to MySQL. Security settings are such that it is possible to assign users limited admin privileges over their own data, enabling them to submit and edit data but not delete QTL entries. User submitted data can be kept private and not accessible to other users until a user is ready to divulge the data for public access. This feature has been implemented to allow users to directly submit data prior to publication, allowing them to view their data in the context of public data in a genomic context.