Biocomputation Core

$324,823P01FY2016AINIH

Ut Southwestern Medical Center, Dallas TX

Investigators

Linked publications & trials

Paper 34330981 Paper 31072819 Paper 30530643 Paper 30119996 Paper 28975893 Paper 28930687 Paper 28522639 Paper 28369485 Paper 28213497 Paper 28102430 Paper 28002407 Paper 27890263 Paper 27708159 Paper 27657660 Paper 27232381 Paper 27009948 Paper 26739560 Paper 26040701 Paper 26024431 Paper 26011644 Paper 25758257 Paper 25658360 Paper 25605905 Paper 25525240 Paper 25416947 Paper 25253354 Paper 24947515 Paper 24907422 Paper 24813206 Paper 24706898 Paper 24335288 Paper 24089564 Paper 23980096 Paper 23836649 Paper 23749869 Paper 23706741 Paper 23601685 Paper 23515163 Paper 23502856 Paper 23500036 Paper 23388631 Paper 23382217 Paper 23255357 Paper 22817992 Paper 22721918 Paper 22442689 Paper 22440911 Paper 22114338 Paper 22106289 Paper 21719711 Paper 21551232 Paper 21525388 Paper 21511177 Paper 21423172 Paper 21402360 Paper 21148033 Paper 21045126 Paper 20978209 Paper 20951135 Paper 20923982 Paper 20876105 Paper 20729857 Paper 20457904 Paper 20404851 Paper 20346772 Paper 20303872 Paper 20190759 Paper 20190135 Paper 20133626 Paper 20093118 Paper 20080593 Paper 20007575 Paper 19926846 Paper 19923465 Paper 19847289 Paper 19668221 Paper 19635904 Paper 19322177 Paper 19223163 Paper 19202056 Paper 19120489 Paper 19120477 Paper 18953338 Paper 18794526 Paper 18591409 Paper 18272355 Paper 18262306 Paper 18066067 Paper 17979849 Paper 17893693 Paper 17579639 Paper 17303405

Abstract

A flexible facility for genetic computation, including both ongoing software development and expansion of hardware resources, is essential for high throughput sequencing operations to succeed. As the technology is new and still evolving rapidly, different platforms are introduced two to three times per year, each requiring some adjustment in order to harness the data pipeline. Stable artifacts must be monitored and recorded in order to eliminate most false calls; annotated coding region and splice junctions must be updated periodically; putative discrepancies that change coding sense must be parsed from those that do not; and finally all calls must be validated or excluded from consideration. Once authentic causative or incidental mutations are established, the data must be ported to a permanent repository without the introduction of error by human operators. This repository, built during the first period of funding, is Mutagenetix. A parallel repository will be established for drosophila mutations. All of these tasks will fall within the purview of Core E. In addition. Core E will model mutagenesis to allow optimized use of ENU in somatic cell mutagenesis studies, to be carried out in Project 1, and in future projects. Core E will be supported by hardware that includes a two large (-3,000 node) Linux cluster computers at the Texas Advanced Computing Center (TACC), a smaller dedicated cluster devoted solely to mapping short reads, and servers that support Mutagenetix (now accessed approximately 3,000 times per month by approximately 1,000 independent users worldwide).

View original record on NIH RePORTER →