Non-linear relationships between the built environment and walking to school: Applying extreme gradient boosting method
Received date: 2021-02-07
Request revised date: 2021-05-25
Online published: 2022-04-28
Supported by
Major Program of the National Social Science Fund of China(20ZDA036)
Key Program of the Center on Child Protection and Development (Sichuan)(ETBH2021-ZD001)
Copyright
Walking is not only a primitive and convenient transport mode but also an important integrant of physical activity, which is beneficial for the promotion of public health, alleviation of traffic congestion, and mitigation of transportation-induced pollution. In modern China, cities are expanding rapidly, people are enjoying a dramatic improvement in living standards, and the pace of life is accelerating. As a result, urban people, including adolescents, tend to travel in motorized modes increasingly more and walk less. The prevalence of physical inactivity among adolescents has brought about a series of health issues, such as deterioration of physical fitness, obesity, and some non-communicable diseases (for example, diabetes and hypertension). Travel to school is among the most important routine travels for adolescents. Promoting adolescents' propensity of walking to school can effectively help them integrate physical activity into daily life and thus enhance their overall physical activity level. Hence, scholars from diverse disciplines (for example, geography, urban planning, and public health) have been drawn to examine the relationships between the built environment and walking to school. However, the current research is insufficient in the following two aspects. First, the existing research is mainly based on the Western context, whereas few studies have been conducted in China. Second, the majority of existing studies assumed a linear or generalized linear (for example, log-linear) relationship between the built environment and walking to school, and no studies, to the best of our knowledge, have examined the non-linear relationships between them. Therefore, this study, taking Xiamen, China as the case and employing its large-scale travel behavior survey dataset in 2015, explored the non-linear effects of the built environment on adolescents' propensity of walking to school. We applied a state-of-the-art machine learning method, namely extreme gradient boosting method (XGBoost), to fit the model, and interpreted the model with relative importance and partial dependence plots. The results show that: 1) Distance from home to school is the most important factor influencing walking to school, with the relative importance of 39.99%. 2) The built environment, which is characterized by the 5Ds (density, diversity, design, destination accessibility, and distance to transit) model, is an important contributor, and relative contributions of the built environment variables at home and school collectively contributed 36.28% of the model's explanatory power, only second to distance to school, much higher than that of sociodemographic variables (23.73%). Distance to city center and population density around both home and school contribute a great deal. 3) All the built environment variables at both ends of school trips and the key sociodemographic variables have non-linear effects on adolescents' propensity of walking to school, and there exist obvious threshold effects. This study can inform decision makers with nuanced policy insights for promoting adolescents' behavior of walking to school.
LIU Jixiang , XIAO Longzhu , ZHOU Jiangping , GUO Yuanyuan , YANG Linchuan . Non-linear relationships between the built environment and walking to school: Applying extreme gradient boosting method[J]. PROGRESS IN GEOGRAPHY, 2022 , 41(2) : 251 -263 . DOI: 10.18306/dlkxjz.2022.02.006
表1 自变量描述性统计Tab.1 Descriptive statistics of independent variables |
变量 | 定义/描述 | 均值(标准差)或百分比 |
---|---|---|
社会经济属性 | ||
年龄 | 受访者年龄(岁) | 11.09 (8.32) |
性别 | 1=女,0=男 | 0=48.0%, 1=52.0% |
住房面积 | 家庭住房面积(m2) | 79.77 (46.37) |
住房性质 | 分类变量(1=自有, 2=单位住房, 3=租住) | 1=81.3%, 2=4.5%, 3=14.2% |
家庭规模 | 家庭成员数量(人) | 4.85 (3.22) |
户口 | 1=有厦门户口,2=无厦门户口 | 0=9.8%, 1=90.2% |
拥有小汽车 | 1=所在家庭拥有至少1辆小汽车,0=无 | 0=83.7%, 1=16.3% |
父/母以步行通勤 | 1=父母亲至少一人以步行为通勤方式,0=无 | 0=81.17%, 1=18.83% |
有人接送 | 1=有长辈接送,0=无 | 0=70.11%, 1=29.89% |
出行特征 | ||
通学距离 | 受访者通学实际距离(km) | 1.24 (1.68) |
建成环境 | ||
人口密度 | 人口数量/TAZ面积(人/km2) | 13194.87 (10325.10) |
容积率 | 总建筑面积/TAZ面积 | 0.88 (0.56) |
土地利用混合度 | 14种主要的土地利用类型(包括居住、工业、教育、行政办公、休闲娱乐、公共开放空间等)的数量及比例,采用改良版的熵值法计算 | 0.58 (0.16) |
交叉口密度 | 道路交叉口数量/ TAZ面积(个/km2) | 82.88 (64.35) |
公交密度 | 公交线路数量/ TAZ面积(条/km2) | 69.44 (62.78) |
离市中心距离 | 从TAZ中心到CBD(即中山路)的路网距离(km) | 8.78 (4.07) |
表2 模型表现Tab.2 Model performance |
模型 | 操作平台 | 最佳迭代数 | 训练误差 | 测试误差 | 预测准确率/% | AUC值 |
---|---|---|---|---|---|---|
XGBoost | R, “XGBoost” | 9931 | 0.1258 | 0.1713 | 82.88 | 0.892 |
GBDT | R, “caret” | 6554 | 0.2188 | 0.2149 | 78.51 | 0.816 |
RF | R, “randomForest” | 500 | 0.1561 | 0.1789 | 82.11 | 0.848 |
LightGBM | Python, “lightGBM” | 19821 | 0.1610 | 0.1937 | 80.63 | 0.803 |
Adaboost | R, “adabag” | 500 | 0.1686 | 0.1923 | 80.77 | 0.798 |
Logistic | R, “glm” | — | 0.2243 | 0.2396 | 76.04 | 0.743 |
表3 自变量相对重要性Tab.3 Relative importance of independent variables |
变量 | 相对重要性/% | 排名 |
---|---|---|
出行特征 | ||
通学距离 | 39.99 | 1 |
社会经济属性 | ||
住房面积 | 7.84 | 2 |
年龄 | 7.13 | 3 |
有人接送 | 2.52 | 15 |
家庭规模 | 1.42 | 17 |
性别 | 1.14 | 18 |
拥有小汽车 | 1.13 | 19 |
父/母以步行通勤 | 0.97 | 20 |
户口 | 0.81 | 21 |
住房性质 | 0.38 | 22 |
小计 | 23.73 | |
出发地(家)建成环境 | ||
土地利用混合度 | 3.67 | 4 |
离市中心距离 | 3.59 | 5 |
道路交叉口密度 | 3.50 | 6 |
容积率 | 3.28 | 8 |
公交站点密度 | 2.90 | 9 |
人口密度 | 2.88 | 10 |
小计 | 19.82 | |
目的地(校)建成环境 | ||
道路交叉口密度 | 3.45 | 7 |
离市中心距离 | 2.82 | 11 |
公交站点密度 | 2.70 | 12 |
容积率 | 2.69 | 13 |
土地利用混合度 | 2.56 | 14 |
人口密度 | 2.24 | 16 |
小计 | 16.46 |
[1] |
|
[2] |
姜玉培, 甄峰, 王文文, 等. 城市建成环境对居民身体活动的影响研究进展与启示[J]. 地理科学进展, 2019, 38(3):357-369.
[
|
[3] |
|
[4] |
王依茹, 王琛, 曾金迪. 个体与环境交互作用下中国成人超重肥胖情况变化趋势及影响因素研究[J]. 地理科学进展, 2020, 39(1):100-110.
[
|
[5] |
|
[6] |
|
[7] |
梁璇. 中日青少年体质健康数字“差”的背后 [N/OL]. 中国青年报, 2017-06-26(8) [2021-01-01]. http://zqb.cyol.com/html/2017-06/26/nw.D110000zgqnb_20170626_1-08.htm.
[
|
[8] |
王敬东. 中国儿童肥胖筛查共识发布, 明确儿童体重超标与严重超标标准 [EB/OL]. 央视网, 2019-09-23 [2021-01-01]. http://news.cctv.com/2019/09/22/ARTIYm98cMCfYj5bNPmK2K0A190922.shtml.
[
|
[9] |
马淑婧, 羊柳, 赵敏, 等. 1991—2015年中国儿童青少年血压水平及高血压检出率的变化趋势[J]. 中华流行病学杂志, 2020, 41(2):178-183.
[
|
[10] |
刘吉祥, 周江评, 肖龙珠, 等. 建成环境对步行通勤通学的影响: 以中国香港为例[J]. 地理科学进展, 2019, 38(6):807-817.
[
|
[11] |
|
[12] |
|
[13] |
|
[14] |
|
[15] |
|
[16] |
|
[17] |
Committee on Environmental Health,
|
[18] |
|
[19] |
|
[20] |
|
[21] |
|
[22] |
|
[23] |
|
[24] |
|
[25] |
|
[26] |
|
[27] |
|
[28] |
|
[29] |
|
[30] |
|
[31] |
|
[32] |
|
[33] |
|
[34] |
|
[35] |
|
[36] |
|
[37] |
自然资源保护协会, 清华大学建筑学院. 中国城市步行友好性评价: 基于街道功能促进步行的研究[R]. 北京, 2017.
[Natural Resources Defense Council, School of Architecture of Tsinghua University. Appraisal of walking friendliness of Chinese cities. Beijing, China. 2017. ]
|
[38] |
|
[39] |
|
[40] |
|
[41] |
|
[42] |
|
[43] |
厦门市统计局. 2020厦门经济特区年鉴[M]. 厦门: 厦门市统计局, 2020.
[Xiamen Burear of Statistics. Statistic yearbook of Xiamen Special Economic Zone in 2020. Xiamen, China: Xiamen Burear of Statistics, 2020.]
|
[44] |
|
/
〈 |
|
〉 |