北大中文核心期刊
中国科学引文数据库(CSCD)来源期刊
中国科技核心期刊
入选数学领域高质量科技期刊
Scopus
EBSCO 

运筹学学报(中英文) ›› 2026, Vol. 30 ›› Issue (2): 24-44.doi: 10.15960/j.cnki.issn.1007-6093.2026.02.002

• • 上一篇    下一篇

OWL1范数约束回归模型的快速算法

门彦超, 郦旭东   

  1. 复旦大学大数据学院, 上海 200433
  • 收稿日期:2023-03-15 发布日期:2026-06-12
  • 通讯作者: 郦旭东 E-mail:lixudong@fudan.edu.cn
  • 基金资助:
    国家自然科学基金 (Nos. 12271107, 62141407), 上海市科学技术委员会基础研究重点项目 (No. 21JC1400600)

Fast algorithm for OWL1 norm constrained regression model

MEN Yanchao, LI Xudong   

  1. School of Data Science, Fudan University, Shanghai 200433, China
  • Received:2023-03-15 Published:2026-06-12

摘要: 随着机器学习技术的发展,模型中的特征数量不断增加,如何进行有效的变量筛选成为一个非常重要的课题。为控制回归系数的错误发生率,Bogdan等(2015)提出在回归模型中加入OWL1范数正则项的SLOPE模型进行变量选择。与前者不同,本文考虑OWL1范数约束回归模型。该模型与SLOPE模型相似,采用多重假设检验的视角进行变量选择,并通过两个参数的调节,更灵活地控制检验的错误发现率。在算法方面,本文基于对偶半光滑牛顿的邻近点算法(PPDNA)快速求解了OWL1范数约束回归模型。该算法外层使用邻近点算法而内层利用半光滑牛顿法高效求解子问题。同时,本文利用了OWL1范数球投影算子广义Jacobian的特殊结构来加速内层算法中的牛顿线性系统的求解。最后,通过在模拟数据及大规模真实数据集上与两种流行算法进行对比,本文验证了新算法的稳健性与有效性。

关键词: OWL1范数, 半光滑牛顿算法, 邻近点算法

Abstract: As machine learning algorithms continue to evolve and incorporate more features, effective variable selection has become a critical issue. To control the false discovery rate of regression coefficients in the variable selection, Bogdan et al. proposed the SLOPE model in 2015, which considers a linear regression model regularized by the OWL1 norm. In this paper, we consider a new regression model that uses an OWL1 norm constraint, enabling more flexible control of the false discovery rate through two parameters, $\lambda$ and $\tau$. We designed a fast algorithm that efficiently solves this model by using the dual semismooth Newton based proximal point algorithm (PPDNA). The algorithm's outer layer uses the proximal point algorithm, while the inner layer employs the semismooth Newton method to solve the subproblems efficiently. Meanwhile, the special structure of the generalized Jacobian induced by the projector onto the OWL1 norm ball is exploited to efficiently handle the corresponding Newton linear systems in the inner algorithm. Finally, we demonstrate the effectiveness and robustness of PPDNA by comparing it with two popular algorithms using both simulated data and large-scale real datasets.

Key words: OWL1 norm, semismooth Newton method, proximal point method

中图分类号: