基于机器学习算法的青少年电子烟使用及影响因素分析

Analysis of adolescent e-cigarette use and influencing factors based on machine learning algorithms

  • 摘要: 目的:了解广西某市15岁以上青少年吸电子烟现状及影响因素,为控制电子烟在青少年中的流行提供资料参考。方法:通过多阶段分层整群随机抽样对广西某市15岁以上青少年进行问卷调查,综合运用logistic回归、随机森林、XGboost、支持向量机模型、单隐藏层神经网络、KNN模型进行影响因素分析。结果:广西某市15岁以上青少年电子烟使用率为1.68%,其中高中生、职高生电子烟使用率分别为1.08%、1.74%;不同的机器学习模型在各项评价指标的表现上各有优劣;青少年使用电子烟的9个主要影响因素包括:过去30 d是否在互联网上看到电子烟广告、朋友是否吸烟、学习压力水平、是否看到过老师吸烟、抑郁情况、性别、公共场合是否看到有人吸烟、吸烟是否使年轻人具有吸引力、是否有人给免费烟草产品。结论:广西某市15岁以上青少年电子烟使用率相对较低,可将6种机器学习模型的结果结合起来对青少年电子烟使用行为进行预测,判断使用人群的特征。

     

    Abstract: Objective: To understand the current situation of e-cigarette use and influencing factors among adolescents aged 15 and above in a certain city in Guangxi in order to provide data and references for controlling the prevalence of e-cigarettes among adolescents.Methods: A questionnaire survey was conducted among adolescents aged 15 and above in a certain city in Guangxi through multi-stage stratified cluster random sampling.Logistic regression, random forest, XGboost, support vector machine models, single hidden layer neural networks, and KNN models were applied comprehensively for the analysis of influencing factors.Results: The prevalence of e-cigarette use among adolescents aged 15 and above in a certain city in Guangxi was 1.68%, with the usage rates among high school and vocational high school students being 1.08% and 1.74%, respectively.Different machine learning models demonstrated varying levels of performance across evaluation metrics.Nine primary influencing factors were identified for adolescent e-cigarette use:exposure to e-cigarette advertisements on the internet in the past 30 days, friends'smoking habits, level of academic pressure, exposure to teachers smoking, depression status, gender, exposure to smoking in public places, perception of smoking as enhancing attractiveness among youth, and receiving free tobacco products.Conclusion: The prevalence of e-cigarette use among adolescents aged 15 and above in the city is relatively low.It is possible to combine the results of six machine learning models to predict adolescent electronic cigarette usage behavior and identify the characteristics of the user population.

     

/

返回文章
返回