國家衛生研究院 NHRI:Item 3990099045/16034
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 12145/12927 (94%)
造訪人次 : 855494      線上人數 : 1154
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋
    主頁登入上傳說明關於NHRI管理 到手機版
    請使用永久網址來引用或連結此文件: http://ir.nhri.org.tw/handle/3990099045/16034


    題名: A novel multitask learning algorithm for tasks with distinct chemical space: zebrafish toxicity prediction as an example
    作者: Lin, RH;Lin, PP;Wang, CC;Tung, CW
    貢獻者: Institute of Biotechnology and Pharmaceutical Research;National Institute of Environmental Health Sciences
    摘要: Data scarcity is one of the most critical issues impeding the development of prediction models for chemical effects. Multitask learning algorithms leveraging knowledge from relevant tasks showed potential for dealing with tasks with limited data. However, current multitask methods mainly focus on learning from datasets whose task labels are available for most of the training samples. Since datasets were generated for different purposes with distinct chemical spaces, the conventional multitask learning methods may not be suitable. This study presents a novel multitask learning method MTForestNet that can deal with data scarcity problems and learn from tasks with distinct chemical space. The MTForestNet consists of nodes of random forest classifiers organized in the form of a progressive network, where each node represents a random forest model learned from a specific task. To demonstrate the effectiveness of the MTForestNet, 48 zebrafish toxicity datasets were collected and utilized as an example. Among them, two tasks are very different from other tasks with only 1.3% common chemicals shared with other tasks. In an independent test, MTForestNet with a high area under the receiver operating characteristic curve (AUC) value of 0.911 provided superior performance over compared single-task and multitask methods. The overall toxicity derived from the developed models of zebrafish toxicity is well correlated with the experimentally determined overall toxicity. In addition, the outputs from the developed models of zebrafish toxicity can be utilized as features to boost the prediction of developmental toxicity. The developed models are effective for predicting zebrafish toxicity and the proposed MTForestNet is expected to be useful for tasks with distinct chemical space that can be applied in other tasks.Scieific contributionA novel multitask learning algorithm MTForestNet was proposed to address the challenges of developing models using datasets with distinct chemical space that is a common issue of cheminformatics tasks. As an example, zebrafish toxicity prediction models were developed using the proposed MTForestNet which provide superior performance over conventional single-task and multitask learning methods. In addition, the developed zebrafish toxicity prediction models can reduce animal testing.
    日期: 2024-08-02
    關聯: Journal of Cheminformatics. 2024 Aug 02;16:Article number 91.
    Link to: http://dx.doi.org/10.1186/s13321-024-00891-4
    JIF/Ranking 2023: http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=NHRI&SrcApp=NHRI_IR&KeyISSN=1758-2946&DestApp=IC2JCR
    Cited Times(WOS): https://www.webofscience.com/wos/woscc/full-record/WOS:001282761800001
    Cited Times(Scopus): https://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85200227651
    顯示於類別:[童俊維] 期刊論文
    [林嬪嬪] 期刊論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    ISI001282761800001.pdf1393KbAdobe PDF40檢視/開啟


    在NHRI中所有的資料項目都受到原著作權保護.

    TAIR相關文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回饋