Website accessibility sampling evaluation based on KNN and local regression
CHEN Rong-hua1, WANG Ying-han2, BU Jia-jun3, YU Zhi3, GAO Fei4
1. College of Information Engineering, Jiangxi Vocational College of Finance & Economics, Jiujiang 332000, China;
2. Department of Information Engineering Shangrao Vocational & Technical College, Shangrao 334109, China;
3. Zhejiang Provincial Key Laboratory of Service Robot, College of Computer Science, Zhejiang University, Hangzhou 310027, China;
4. College of Information Engineering, Putian University, Putian 351100, China
A novel sampling evaluation algorithm was proposed for a given page based on the KNN evaluated samples. As sampling in a large website tends to be sparse, KNN may lead to a high evaluation bias and a local regression model was thus employed to improve the quality of KNN-based evaluation. First, a certain number of webpages were randomly selected from a website and evaluated to obtain an initial website accessibility score. Each evaluated webpage was treated as a labeled sample and the accessibility scores for the rest pages in the website were estimated using local regression on the KNN. The experimental results validate that the proposed algorithm has significant improvement over the random sampling algorithm in website accessibility evaluation.
Received: 08 November 2017
Published: 20 September 2018
CHEN Rong-hua, WANG Ying-han, BU Jia-jun, YU Zhi, GAO Fei. Website accessibility sampling evaluation based on KNN and local regression. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2018, 52(9): 1702-1708.
[1] 丁明华. 政府APP:移动电子政务发展模式新思路[J]. 商业经济研究, 2014(12):66-67 DING Ming-hua. Government APP:new development thoughts of mobile e-government affairs [J]. Business and Economy Study, 2014(12):66-67
[2] 陈锐浩. 移动互联网时代高校资源库建设思路及实践[J]. 深圳职业技术学院学报, 2017, 16(1):39-44 CHEN Rui-hao. Constructive thinking and practice of the university resources system in a mobile internet era [J]. Journal of Shenzhen Polytechnic, 2017, 16(1):39-44
[3] 姜卫玲, 储颖. "互联网+"与区域文化影响力:发展现状、提升策略[J]. 阅江学刊, 2017(6):117-123 JIANG Wei-ling, CHU Ying. Current situation and improving strategy of "Internet+" and regional cultural influence [J]. Yuejiang Academic Journal, 2017(6):117-123
[4] AVILA C, BALDIRIS S, FABREGAT R, et al. A web content accessibility evaluation process for learning objects in the context of a virtual learning environment[M]//New Horizons in Web Based Learning. Berlin Heidelberg: Springer, 2014:181-190.
[5] 杨飞. 论残疾人的信息无障碍权[J]. 河南财经政法大学学报, 2013, 28(2):118-124 YANG Fei. On the barrier-free right of information of the disabled [J]. Journal of Henan University of Economics and Law, 2013, 28(2):118-124
[6] 周宇. 基于抽样和模板的网站无障碍检测方法[D]. 杭州:浙江大学, 2014. ZHOU Yu. Website accessibility detection methods basic on sampling and template[D]. Hangzhou:Zhejiang University. 2014.
[7] ABOUZAHRA S. Web accessibility evaluation[M]//Web Accessibility:a Foundation for Research. 2008:79-106.
[8] 赵英, 傅沛蕾. 网页信息无障碍主观检测方法研究[J]. 图书馆学研究, 2016(9):24-33 ZHAO Ying, FU Pei-lei. Research on subjective detection method of web information accessibility [J]. Research on Library Science, 2016(9):24-33
[9] DEBNATH S, MITRA P, GILES C L. Automatic extraction of informative blocks from webpages[C]//ACM Symposium on Applied Computing. Santa Fe:ACM, 2005:1722-1726.
[10] 欧阳佳, 林丕源. 基于DBSCAN算法的网页正文提取[J]. 计算机工程, 2011, 37(3):64-66 OU Yang-jia, LIN Pi-yuan. Webpage content extraction based on DBSCAN [J]. Computer Engineering, 2011, 37(3):64-66
[11] NIETZIO A, STROBBE C, VELLEMAN E. The unified web evaluation methodology (UWEM) 1.2 for WCAG 1.0[M]//Computers Helping People with Special Needs. Berlin Heidelberg:Springer, 2008:394-401.
[12] 李美艳. 基于权重学习的网站无障碍检测度量方法研究[D]. 杭州:浙江大学, 2016. LI Mei-yan. Research on measure method of website accessibility detection based on weight learning[D]. Hangzhou:Zhejiang University. 2016.
[13] 陈静. 面向网页信息无障碍合规性检测的样例推荐系统[D]. 杭州:浙江大学, 2017. CHEN Jing, The recommendation system of sample pages for web accessibility evaluation[D]. Hangzhou:Zhejiang University. 2017.
[14] Monika R. Henzinger, Allan Heydon. Michael mitzenmacher on near-uniform URL sampling[J]. Computer Networks, 2000, 33(1):295-308.
[15] ZHANG M N, WANG C, BU J J, et al. An optimal sampling method for web accessibility quantitative metric[J]. Internet Research, 2015(7):1-4.
[16] ZHANG M N, WANG C, BU J J, et al. A sampling method based on URL clustering for fast web accessibility evaluation[J]. Frontiers of Information Technology & Electronic Engineering, 2015, 16(6):449-456.
[17] W3C/WAI. Conformance evaluation of web sites for accessibility:determine the scope of the evaluation[EB/OL]. (2016-10-24)[2017-09-20]. http://www.w3.org/WAI/eval/conformance.html#scope.
[18] NIETZIO A, STROBBE C, VELLEMAN E. The unified web evaluation methodology (UWEM) 1.2 for WCAG 1.0[M]. Berlin Heidelberg:Springer, 2008:394-401.
[19] 丁姝洁. 基于检测点分布抽样的网站无障碍检测[D]. 浙江大学, 2015. DING Shu-jie. Web accessibility evaluation based on checkpoint sampling[D]. Hangzhou:Zhejiang University, 2015.
[20] iris数据的KNN实现[EB/OL]. (2016-11-10).[2017-11-08]. https://blog.csdn.net/chenchunyue11/article/details/51416117.
WEI Xiao-feng, CHENG Cheng-qi, CHEN Bo, WANG Hai-yan. Chain code based on independent edge number[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2018, 52(9): 1686-1693.