中国综合性科技类核心期刊(北大核心)

中国科学引文数据库来源期刊(CSCD)

美国《化学文摘》(CA)收录

美国《数学评论》(MR)收录

俄罗斯《文摘杂志》收录

Message Board

Respected readers, authors and reviewers, you can add comments to this page on any questions about the contribution, review, editing and publication of this journal. We will give you an answer as soon as possible. Thank you for your support!

Name
E-mail
Phone
Title
Content
Verification Code
Issue 3
May  2018
Turn off MathJax
Article Contents
KUANG Jun, TANG Wei-hong, CHEN Lei-hui, CHEN Hui, ZENG Wei, DONG Qi-min, GAO Ming. Algorithm for video click-through rate prediction[J]. Journal of East China Normal University (Natural Sciences), 2018, (3): 77-87. doi: 10.3969/j.issn.1000-5641.2018.03.009
Citation: KUANG Jun, TANG Wei-hong, CHEN Lei-hui, CHEN Hui, ZENG Wei, DONG Qi-min, GAO Ming. Algorithm for video click-through rate prediction[J]. Journal of East China Normal University (Natural Sciences), 2018, (3): 77-87. doi: 10.3969/j.issn.1000-5641.2018.03.009

Algorithm for video click-through rate prediction

doi: 10.3969/j.issn.1000-5641.2018.03.009
  • Received Date: 2017-05-19
  • Publish Date: 2018-05-25
  • Click-through rate prediction has played an important role in video recommendation systems. A video recommendation system can suggest media to users based on the results of click-through rate prediction. In this way, users may be more likely to click the videos recommended by platforms. However, given the volume and imbalance of data in some applications, the accuracy of click-through rate prediction may be very low. To improve the performance, this paper proposes an integrated approach by combining feature engineering with techniques from machine learning. In the first stage, the algorithm uses feature engineering to extract user, video, and combinational features from the original dataset. In the second stage, the algorithm predicts the click-through rate by employing supervised models of logistic regression, factorization machine, and gradient boosting decision tree combined with logistic regression. The experimental results illustrate that the prediction accuracy of the factorization machine model and the gradient boosting decision tree combined with logistic regression model are better than the logistic regression model. Moreover, the cross combination of user and video features can improve the accuracy of the click-through rate prediction.
  • loading
  • [1]
    RENDLE S. Factorization machines[C]//IEEE International Conference on Data Mining. IEEE Computer Society, 2010: 995-1000.
    [2]
    FRIEDMAN J H. Greedy function approximation:A gradient boosting machine[J]. Annals of Statistics, 2001, 29(5):1189-1232. http://projecteuclid.org/Dienst/getRecord?id=euclid.aos/1013203451/
    [3]
    HE X, PAN J, JIN O, et al. Practical lessons from predicting clicks on ads at Facebook[C]//Proceedings of the 8th International Workshop on Data Mining for Online Advertising. ACM, 2014: 1-9.
    [4]
    纪文迪, 王晓玲, 周傲英.广告点击率估算技术综述[J].华东师范大学学报(自然科学版), 2013(3):1-14.
    [5]
    RICHARDSON M, DOMINOWSKA E, RAGNO R. Predicting clicks: Estimating the click-through rate for new ads[C]//International Conference on World Wide Web. ACM, 2007: 521-530.
    [6]
    CHAPELLE O, ZHANG Y. A dynamic bayesian network click model for web search ranking[C]//International Conference on World Wide Web. ACM, 2009: 1-10.
    [7]
    GRAEPEL T, CANDELA J Q, BORCHERT T, et al. Web-scale Bayesian click-through rate prediction for sponsored search advertising in Microsoft's Bing Search engine[C]//International Conference on Machine Learning. DBLP, 2010: 13-20.
    [8]
    JOACHIMS T. Optimizing search engines using click-through data[C]//Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2002: 133-142.
    [9]
    SHAN L, LIN L, SUN C, et al. Predicting ad click-through rates via feature-based fully coupled interaction tensor factorization[J]. Electronic Commerce Research & Applications, 2016, 16(C):30-42. http://www.sciencedirect.com/science/article/pii/S1567422316000144
    [10]
    YAN L, LI W J, XUE G R, et al. Coupled group lasso for web-scale CTR prediction in display advertising[C]//International Conference on Machine Learning. 2014: 802-810.
    [11]
    AGARWAL D, LONG B, TRAUPMAN J, et al. LASER: A scalable response prediction platform for online advertising[C]//ACM International Conference on Web Search and Data Mining. ACM, 2014: 173-182.
    [12]
    AQUIAR E, NAGRECHA S, CHAWLA N V. Predicting online video engagement using clickstreams[C]//IEEE International Conference on Data Science and Advanced Analytics (DSAA). IEEE, 2015. DOI: 10.1109/DSAA.2015.7344873.
    [13]
    李思琴, 林磊, 孙承杰.基于卷积神经网络的搜索广告点击率预测[J].智能计算机与应用, 2015(5):22-25. http://www.cqvip.com/QK/94259A/201505/666610458.html
    [14]
    SCHAPIRE R E. A brief introduction to boosting[C]//16th International Joint Conference on Artificial Intelligence. [S. l. ]: Morgan Kaufmann Publishers Inc, 1999: 1401-1406.
    [15]
    QUINLAN J R. Induction on decision tree[J]. Machine Learning, 1986(1):81-106. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5454676
    [16]
    HARTIGAN J A, WONG M A. Algorithm AS 136:A k-means clustering algorithm[J]. Applied Statistics, 1979, 28(1):100-108. doi:  10.2307/2346830
    [17]
    BREIMAN L. Out-of-bag estimation[R]. Berkeley: University of California, 1996.
    [18]
    BREIMAN L. Bagging Predictors[M].[S.l.]:Kluwer Academic Publishers, 1996.
    [19]
    CHEN T, GUESTRIN C. XGBoost: A scalable tree boosting system[C]//ACM SIGKDD International Conference. ACM, 2016: 785-794.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(5)  / Tables(9)

    Article views (172) PDF downloads(314) Cited by()
    Proportional views

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return