YEARS

2007-2009

AUTHORS

Degui Zhi

TITLE

Algorithms for protein superfamily classification and function prediction

ABSTRACT

DESCRIPTION (provided by applicant): I am a postdoctoral scholar associated with Steven Brenner's lab at Berkeley, working on structural biology and computational genomics. My long-term vision is to develop new algorithms for inferring protein evolution and function from sequence and structure. Currently I am working on algorithms that can automatically classify a protein into its proper superfamily. The long-term goal of this project is to improve the accuracy of protein structure classification and function prediction. The superfamily defines ancient protein homology. Protein superfamily classification remains a challenging task, even when 3D structure is available. Currently this task still requires experts' manual work. We believe that the classification of protein superfamilies relies on the integration of sequence information and structure information. We will employ recent breakthroughs in kernel-based machine learning approaches for combining different sources of information. We will also develop structure-based discriminative profile models for protein superfamilies. We expect these algorithmic developments will not only result in a practical tool for superfamily classification, but they will also improve our understanding of the interplay of sequence and structure on defining very remote homology. We will extend our structure-based discriminative profile models for protein classification to function prediction. We will develop new methods for the identification of structure-sequence signatures of protein functioin. In addition, we will extend the graph theoretical models for multiple sequence alignment I developed during my Ph.D. study to meet the challenge of domain annotation for large new sequence set. The advancement of medical research is partly based on our detailed understanding of the functions of genes and proteins. My research will improve our understanding of protein evolution and function at the molecular level. Our computational approach will speed up the discovery of biological knowledge from large data sets generated by high-throughput methods.

FUNDED PUBLICATIONS

  • Alignment-free local structural search by writhe decomposition.
  • How to use: Click on a object to move its position. Double click to open its homepage. Right click to preview its contents.

    Download the RDF metadata as:   json-ld nt turtle xml License info


    20 TRIPLES      17 PREDICATES      21 URIs      9 LITERALS

    Subject Predicate Object
    1 grants:8f92ad5d61e59a6a78952dd7a6fab025 sg:abstract DESCRIPTION (provided by applicant): I am a postdoctoral scholar associated with Steven Brenner's lab at Berkeley, working on structural biology and computational genomics. My long-term vision is to develop new algorithms for inferring protein evolution and function from sequence and structure. Currently I am working on algorithms that can automatically classify a protein into its proper superfamily. The long-term goal of this project is to improve the accuracy of protein structure classification and function prediction. The superfamily defines ancient protein homology. Protein superfamily classification remains a challenging task, even when 3D structure is available. Currently this task still requires experts' manual work. We believe that the classification of protein superfamilies relies on the integration of sequence information and structure information. We will employ recent breakthroughs in kernel-based machine learning approaches for combining different sources of information. We will also develop structure-based discriminative profile models for protein superfamilies. We expect these algorithmic developments will not only result in a practical tool for superfamily classification, but they will also improve our understanding of the interplay of sequence and structure on defining very remote homology. We will extend our structure-based discriminative profile models for protein classification to function prediction. We will develop new methods for the identification of structure-sequence signatures of protein functioin. In addition, we will extend the graph theoretical models for multiple sequence alignment I developed during my Ph.D. study to meet the challenge of domain annotation for large new sequence set. The advancement of medical research is partly based on our detailed understanding of the functions of genes and proteins. My research will improve our understanding of protein evolution and function at the molecular level. Our computational approach will speed up the discovery of biological knowledge from large data sets generated by high-throughput methods.
    2 sg:endYear 2009
    3 sg:fundingAmount 146880.0
    4 sg:fundingCurrency USD
    5 sg:hasContribution contributions:023adeb7cd042bc1d530683f1ac183a6
    6 sg:hasFieldOfResearchCode anzsrc-for:06
    7 anzsrc-for:0601
    8 anzsrc-for:08
    9 anzsrc-for:0801
    10 sg:hasFundedPublication articles:b928760063d0693c382caf11cdffce52
    11 sg:hasFundingOrganization grid-institutes:grid.429651.d
    12 sg:hasRecipientOrganization grid-institutes:grid.47840.3f
    13 sg:language English
    14 sg:license http://scigraph.springernature.com/explorer/license/
    15 sg:scigraphId 8f92ad5d61e59a6a78952dd7a6fab025
    16 sg:startYear 2007
    17 sg:title Algorithms for protein superfamily classification and function prediction
    18 sg:webpage http://projectreporter.nih.gov/project_info_description.cfm?aid=7495992
    19 rdf:type sg:Grant
    20 rdfs:label Grant: Algorithms for protein superfamily classification and function prediction
    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular JSON format for linked data.

    curl -H 'Accept: application/ld+json' 'http://scigraph.springernature.com/things/grants/8f92ad5d61e59a6a78952dd7a6fab025'

    N-Triples is a line-based linked data format ideal for batch operations .

    curl -H 'Accept: application/n-triples' 'http://scigraph.springernature.com/things/grants/8f92ad5d61e59a6a78952dd7a6fab025'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'http://scigraph.springernature.com/things/grants/8f92ad5d61e59a6a78952dd7a6fab025'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'http://scigraph.springernature.com/things/grants/8f92ad5d61e59a6a78952dd7a6fab025'






    Preview window. Press ESC to close (or click here)


    ...