國家衛生研究院 NHRI:Item 3990099045/16034
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 12145/12927 (94%)
Visitors : 851339      Online Users : 706
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: http://ir.nhri.org.tw/handle/3990099045/16034


    Title: A novel multitask learning algorithm for tasks with distinct chemical space: zebrafish toxicity prediction as an example
    Authors: Lin, RH;Lin, PP;Wang, CC;Tung, CW
    Contributors: Institute of Biotechnology and Pharmaceutical Research;National Institute of Environmental Health Sciences
    Abstract: Data scarcity is one of the most critical issues impeding the development of prediction models for chemical effects. Multitask learning algorithms leveraging knowledge from relevant tasks showed potential for dealing with tasks with limited data. However, current multitask methods mainly focus on learning from datasets whose task labels are available for most of the training samples. Since datasets were generated for different purposes with distinct chemical spaces, the conventional multitask learning methods may not be suitable. This study presents a novel multitask learning method MTForestNet that can deal with data scarcity problems and learn from tasks with distinct chemical space. The MTForestNet consists of nodes of random forest classifiers organized in the form of a progressive network, where each node represents a random forest model learned from a specific task. To demonstrate the effectiveness of the MTForestNet, 48 zebrafish toxicity datasets were collected and utilized as an example. Among them, two tasks are very different from other tasks with only 1.3% common chemicals shared with other tasks. In an independent test, MTForestNet with a high area under the receiver operating characteristic curve (AUC) value of 0.911 provided superior performance over compared single-task and multitask methods. The overall toxicity derived from the developed models of zebrafish toxicity is well correlated with the experimentally determined overall toxicity. In addition, the outputs from the developed models of zebrafish toxicity can be utilized as features to boost the prediction of developmental toxicity. The developed models are effective for predicting zebrafish toxicity and the proposed MTForestNet is expected to be useful for tasks with distinct chemical space that can be applied in other tasks.Scieific contributionA novel multitask learning algorithm MTForestNet was proposed to address the challenges of developing models using datasets with distinct chemical space that is a common issue of cheminformatics tasks. As an example, zebrafish toxicity prediction models were developed using the proposed MTForestNet which provide superior performance over conventional single-task and multitask learning methods. In addition, the developed zebrafish toxicity prediction models can reduce animal testing.
    Date: 2024-08-02
    Relation: Journal of Cheminformatics. 2024 Aug 02;16:Article number 91.
    Link to: http://dx.doi.org/10.1186/s13321-024-00891-4
    JIF/Ranking 2023: http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=NHRI&SrcApp=NHRI_IR&KeyISSN=1758-2946&DestApp=IC2JCR
    Cited Times(WOS): https://www.webofscience.com/wos/woscc/full-record/WOS:001282761800001
    Cited Times(Scopus): https://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85200227651
    Appears in Collections:[Chun-Wei Tung] Periodical Articles
    [Pinpin Lin] Periodical Articles

    Files in This Item:

    File Description SizeFormat
    ISI001282761800001.pdf1393KbAdobe PDF40View/Open


    All items in NHRI are protected by copyright, with all rights reserved.

    Related Items in TAIR

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback