科学研究

科学研究

学术讲座
当前位置是: 首页 -> 科学研究 -> 学术讲座 -> 正文

Positive and Unlabeled Data: Model, Estimation, Inference, and Classification

作者: 发布时间:2024-09-23 点击数:
主讲人:田庆隆
主讲人简介:

Qinglong Tian is an assistant professor at the department of statistics and actuarial science at the university of waterloo. He graduated from Renmin university of China with a BS in statistics in 2016 and from Iowa State University with a PhD in statistics in 2021. His current research interests include transfer learning, domain adaptation, and out-of-distribution detection.

主持人:王淳林
讲座简介:

This study introduces a new approach to addressing positive and unlabeled (PU) data through the double exponential tilting model (DETM). Traditional methods often fall short because they only apply to selected completely at random (SCAR) PU data, where the labeled positive and unlabeled positive data are assumed to be from the same distribution. In contrast, our DETM's dual structure effectively accommodates the more complex and underexplored selected at random PU data, where the labeled and unlabeled positive data can be from different distributions. We rigorously establish the theoretical foundations of DETM, including identifiability, parameter estimation, and asymptotic properties. Additionally, we move forward to statistical inference by developing a goodness-of-fit test for the SCAR condition and constructing confidence intervals for the proportion of positive instances in the target domain. We leverage an approximated Bayes classifier for classification tasks, demonstrating DETM's robust performance in prediction. Through theoretical insights and practical applications, this study highlights DETM as a comprehensive framework for addressing the challenges of PU data.

时间:2024-09-26 (Thursday) 16:40-18:00
地点:经济楼N302
讲座语言:中文
主办单位:太阳成集团tyc7111cc、王亚南经济研究院、邹至庄经济研究院
承办单位:
期数:
联系人信息:周梦娜:2182886,zmn1994@xmu.edu.cn
TOP