Author's Guide

MORE>>
  • 2018 No.11
  • Published:25 November 2018
  • Directed by: Chiness Academy of Sciences
  • Sponsored by: National Science Library, Chinese Academy of Sciences
  • Published by: Editorial Board of New
    Data Analysis and Knowledge Discovery
      25 November 2018, Volume 2 Issue 11 Previous Issue   
    For Selected: View Abstracts Toggle Thumbnails
    Predicting Conversion Rate of APP Advertising with Machine Learning
    Yang Zhao,Xini Yuan,Yawen Chen,Liqiang Wu
    2018, 2 (11): 2-9.  DOI: 10.11925/infotech.2096-3467.2018.0834
    Abstract   HTML   PDF (741KB) ( 5 )

    [Objective] This paper tries to predict the conversion rate of APP advertisements with the help of machine learning algorithms, aiming to improve the effectiveness of advertising and marketing activities. [Methods] First, we examined the characteristics of APP advertisements. Then, we applied four machine learning algorithms to predict their conversion rate. The proposed RF+LXFV model was built with Random Forest, Gradient Boosting Decision Tree, Random Forest, LightGBM, XGBoost, Vowpal Wabbit and Field-aware Factorization Machine. Finally, we evaluated the validity and accuracy of the new model with Tencent APP advertising data. [Results] The prediction results of the proposed model achieved higher accuracy than those of the single algorithm. [Limitations] We did not examine the impacts of advertising transformation delay on prediction. [Conclusions] The proposed RF+LXFV model could predict the conversion rate of APP advertising effectively.

    Figures and Tables | References | Related Articles | Metrics
    Predicting Repeat Purchase Intention of New Consumers
    Liyi Zhang,Yiran Li,Xuan Wen
    2018, 2 (11): 10-18.  DOI: 10.11925/infotech.2096-3467.2018.0823
    Abstract   HTML   PDF (944KB) ( 6 )

    [Objective] This paper compares the prediction accuracy and efficiency of different machine learning algorithms, aiming to identify new consumers with repeat purchase intentions. It also provides a theoretical framework for customer classification. [Methods] First, we collected the server logs of a dealer on Taobao.com from 2015 to 2018, as well as its orders and consumers’ personal information. And then, we used different algorithms to train the proposed models. [Results] The SMOTE algorithm combined with the random forest algorithm obtained the highest prediction accuracy of 96%. [Limitations] The sample data size needs to be expanded. [Conclusions] The fusion algorithm based on SMOTE and random forest has better performance in predicting repurchase intentions of new consumers.

    Figures and Tables | References | Related Articles | Metrics
    Examining Consumer Reviews of Overseas Shopping APP with Sentiment Analysis
    Yang Zhao,Qiqi Li,Yuhan Chen,Wenhang Cao
    2018, 2 (11): 19-27.  DOI: 10.11925/infotech.2096-3467.2018.0835
    Abstract   HTML   PDF (1546KB) ( 5 )

    [Objective] This paper analyzes the sentiment of online reviews, and then evaluates the consumer’s satisfaction with overseas shopping APP, aiming to improve its performance. [Methods] First, we collected reviews of these APPs from the APP Store. Then, we clustered the APPs’ attributes with Canopy and K-means algorithms, which defines the evaluation dimensions of consumer’s satisfaction. Finally, we computed scores of the consumer’s satisfaction with the CNN-SVM sentiment analysis model. [Results] The most important factor affecting the consumer’s satisfaction with overseas shopping APP was commodities, followed by price, interaction, service, and logistics. The consumer’s satisfaction level with the vertical overseas shopping APP was higher than that of the overseas buyer APP and the comprehensive overseas shopping APP. The consumer’s satisfaction level is relatively low with logistics and services. [Limitations] More overseas shopping APP should be included in future research. [Conclusions] The sentiment analysis method is an effective way to analyze consumer’s satisfaction with online reviews of overseas shopping APP.

    Figures and Tables | References | Related Articles | Metrics
    Recommendation Algorithm for Post-Context Filtering Based on TF-IDF: Case Study of Catering O2O
    Cong Yin,Liyi Zhang
    2018, 2 (11): 28-36.  DOI: 10.11925/infotech.2096-3467.2018.0832
    Abstract   HTML   PDF (821KB) ( 4 )

    [Objective] This paper carries out an in-depth study on context-integrated and personalized recommendation, aiming to address the issue of information overload. [Methods] We proposed a new contextual preference prediction model based on TF-IDF algorithm for post-context filtering, as well as the contextual association probability and universal importance. Then, we adjusted the initial scores of traditional recommendation with the help of item category preferences to generate the final list. [Results] We conducted an empirical study on catering industry and found that the proposed algorithm yielded better results. [Limitations] The accuracy of the context association needs to be improved. [Conclusions] Context information plays an important role in user behavior and decision making. More research is needed to improve the personalized recommendation based on context modeling.

    Figures and Tables | References | Related Articles | Metrics
    Evaluating and Optimizing Supply Chains with LMBP Algorithm
    Hu Meng,Xiaobei Liang,Yixiong Yang,Min Li
    2018, 2 (11): 37-45.  DOI: 10.11925/infotech.2096-3467.2018.0833
    Abstract   HTML   PDF (625KB) ( 0 )

    [Objective] This paper uses the LMBP algorithm of feedback neural network to evaluate and optimize the supply chains, aiming to improve the decision-making of enterprises. [Methods] First, we built an evaluation model for supply chains. Then, we generated 21 indicators for corporate performance based on this model. Third, we used the MATLAB to evaluate this algorithm. [Results] The proposed method helped enterprises obtain the results of performance analysis in time, and then improved the management of procurement, inventory, and sales. It reduced the operation costs of enterprises, and improved the decision making process. [Limitations] The new method should be examined with more cases. [Conclusions] The proposed method could improve the performance of supply chains.

    Figures and Tables | References | Related Articles | Metrics
    Impacts of Landlords on Tenants of Short-term Rentals
    Xiaobei Liang,Zhen Xu,Jingjing Li
    2018, 2 (11): 46-53.  DOI: 10.11925/infotech.2096-3467.2018.0836
    Abstract   HTML   PDF (517KB) ( 1 )

    [Objective] This study explores the influences of room owners on their tenants’ Electronic Word of Mouth(eWOM). [Methods] First, we retrieved data from Airbnb with the help of a Web crawler. Then, we proposed a Poisson Regression model based on the signal theory. Finally, we studied the impacts of room owners’ service on consumers’ eWOM. [Results] The eWOM of the available rooms was positively correlated with features introduction, after-sales interaction, instant reservation, calendar update, response time, high-quality service and identity certification. [Limitations] More samples from regions outside of Beijing should be included. [Conclusions] The proposed model could improve the service of short-term rentals.

    Figures and Tables | References | Related Articles | Metrics
    Detecting Relationship Among WeChat Group Members with Co-occurrence of Cooperation
    Gang Li,Xiao Wang,Yang Guo
    2018, 2 (11): 54-63.  DOI: 10.11925/infotech.2096-3467.2018.0320
    Abstract   HTML   PDF (1546KB) ( 0 )

    [Objective] This paper analyzes the implicit relationship among WeChat group members and meaures its strength, which is also combined with their explicit relatinship to describe the social network characteristics of WeChat groups. [Methods] First, we collected chatting records from one WeChat interest group. Then, we used the co-occurrence to measure the implicit relationship and the salton index to calculate their strength. Third, we analyzed the discussion participation to explore the implicit-relationship distribution. Finally, we compared the full-relationship network with explicit-relationship network. [Results] We found that topic discussion clearly reflected relationship among group members. Posting more relevant topics helps to manage and maintain membership. [Limitations] More research is needed to measure goup members’ engagement. [Conclusions] The full-network with implicit and explicit relationship reveals more insights on the structure of WeChat group.

    Figures and Tables | References | Related Articles | Metrics
    研究论文
    Analyzing Scientific Literature with Content Similarity - Topics over Time Model
    Weilin He,Guohe Feng,Hongling Xie
    2018, 2 (11): 64-72.  DOI: 10.11925/infotech.2096-3467.2018.0292
    Abstract   HTML   PDF (1106KB) ( 2 )

    [Objective] This paper studies the topics of scientific literature and then tracks their changes.[Methods] We used the improved CSToT Model (Content Similarity - Topics over Time), to analyze scholarly papers from 9 information science journals in China published from 2012-2016. [Results] The CSToT model effectively revealed the subject structure of scientific literature and the evolution of topics. We also found that majority of the current information science research covers information services, online public opinion and data mining. Their evolution trends include rising, falling, stable and fluctuating patterns, which are particularly prominent in information services research. [Limitations] The training data set needs to be expanded. [Conclusions] The CSToT model could effectively identify the topics of scientific literature and their evolutionary trends, which provide new directions for future research.

    Figures and Tables | References | Related Articles | Metrics
    Predicting Transactions Among Agents in Patent Transfer Weighted Networks for New Energy
    Yuying Wu,Ping Sun,Xijun He,Guorui Jiang
    2018, 2 (11): 73-79.  DOI: 10.11925/infotech.2096-3467.2018.0254
    Abstract   HTML   PDF (540KB) ( 0 )

    [Objective] This paper examines the structure of weighted network for patent transfers as well as the characteristics of agents, aiming to predict transaction opportunities and promote the connection of technology supply and demand. [Methods] First, we constructed a weighted network for patented technology transactions based on data from 2012 to 2016. Then, we used the entropy method to combine its structure and contents. Finally, we used the BP neural network to predict transaction opportunities and weights. [Results] The prediction accuracy by the proposed method, which combined the structure index RA and the content index Cosine, was the highest. The prediction error was also reduced by using the real and structure weights of the network to predict the link weight. [Limitations] More research is needed to study the Node properties and network evolution mechanism. [Conclusions] The link prediction method has a higher precision, which help us find potential supply and demand agents of the technology patent transfers.

    Figures and Tables | References | Related Articles | Metrics
    Studying Knowledge Dissemination of Online Q&A Community with Social Network Analysis
    Zhongyi Wang,Heming Zhang,Jing Huang,Chunya Li
    2018, 2 (11): 80-94.  DOI: 10.11925/infotech.2096-3467.2018.0293
    Abstract   HTML   PDF (3421KB) ( 11 )

    [Objective]This paper analyzes the social network structure and knowledge dissemination mechanism of an online Q&A community, aiming to reveal the role of network nodes, and improve the learning efficiency. [Methods] First, we used the social network analysis and the entropy weight methods to describe the opinion leader’s knowledge and influence. Then, we built a knowledge dissemination model based on the Cowan model for the Q&A community. Finally, we examined the internal knowledge learning results of the network through system simulation. [Results] Ⅰ. The nodes with less knowledge had higher learning efficiency in the target network; Ⅱ. The knowledge volumes of some nodes increased rapidly, while those of the nodes with larger knowledge stock increased slowly; Ⅲ. The knowledge dissemination rate of this network has been decreasing; Ⅳ. There is strong correlation between knowledge increase and the index of knowledge and communication abilities. [Limitations] The dynamic random reconnection of network was not examined in this paper. [Conclusions] This paper offers practical advice to improve users’ learning experience in the online Q&A community.

    Figures and Tables | References | Related Articles | Metrics
    Choosing Stopwords for Patent Topic Analysis Based on Auxiliary Set
    Yan Yu,Naixuan Zhao
    2018, 2 (11): 95-103.  DOI: 10.11925/infotech.2096-3467.2018.0240
    Abstract   HTML   PDF (591KB) ( 4 )

    [Objective] This paper proposes a new method to automatically choose domain specific stopwords, aiming to improve the performance of patent topic analysis. [Methods] First, we introduced an auxiliary set and proposed two indexes of document frequency and entropies among categories based on this auxiliary set. Then, we measured the distribution of words from the auxiliary set to choose the domain specific stopwords automatically. [Results] The proposed method improved the quality of identified patent topics. [Limitations] The types and members of the auxiliary set need to be further studied. [Conclusions] The proposed stopwords selection methods could measure the characteristics of words, which helps us find the domain specific stopwords for patent analysis more effectively.

    Figures and Tables | References | Related Articles | Metrics
Investigating the Evolution Path and Hot Topics of Citizen Science Studies Abroad
Zhang Xuanhui, Zhao Yuxiang
DOI: 10.11925/infotech.2096-3467.2017.0519
2018
Vol.2
No.10 
2018-10-25
No.9
2018-09-25
No.8
2018-08-25
No.7
2018-07-25
No.6
2018-06-25
No.5
2018-05-25
No.4
2018-04-25
No.3
2018-03-25
No.2
2018-02-25
No.1
2018-01-25
2017
Vol.1
No.12 
2017-12-25
No.11
2017-11-25
No.10
2017-10-25
No.9
2017-09-25
No.8
2017-08-25
No.7
2017-07-25
No.6
2017-06-25
No.5
2017-05-25
No.4
2017-04-25
No.3
2017-03-25
No.2
2017-02-25
No.1
2017-01-25
2016
Vol.32
No.12 
2016-12-25
No.11
2016-11-25
No.10
2016-10-25
No.9
2016-09-25
No.7-8
2016-08-25
No.6
2016-06-25
No.5
2016-05-25
No.4
2016-04-25
No.3
2016-03-25
No.2
2016-02-25
No.1
2016-01-25
2015
Vol.31
No.12 
2015-12-25
No.11
2015-11-25
No.10
2015-10-25
No.9
2015-09-25
No.7-8
2015-08-25
No.6
2015-06-25
No.5
2015-05-25
No.4
2015-04-25
No.3
2015-03-25
No.2
2015-02-25
No.1
2015-01-25
2014
Vol.30
No.12 
2014-12-25
No.11
2014-11-25
No.10
2014-10-25
No.9
2014-09-25
No.7
2014-08-25
No.6
2014-06-25
No.5
2014-05-25
No.4
2014-04-25
No.3
2014-03-25
No.2
2014-02-25
No.1
2014-01-25
2013
Vol.29
No.12 
2013-12-25
No.11
2013-11-25
No.10
2013-10-25
No.9
2013-09-25
No.7
2013-08-25
No.6
2013-06-25
No.5
2013-05-25
No.4
2013-04-25
No.3
2013-03-25
No.2
2013-02-25
No.1
2013-01-25
2012
Vol.28
No.12 
2012-12-25
No.11
2012-11-25
No.10
2012-10-25
No.9
2012-09-25
No.7
2012-08-25
No.6
2012-06-25
No.5
2012-05-25
No.4
2012-04-25
No.3
2012-03-25
No.2
2012-02-25
No.1
2012-01-25
2011
Vol.27
No.12 
2011-12-25
No.11
2011-11-25
No.10
2011-10-25
No.9
2011-09-25
No.7
2011-08-25
No.6
2011-06-25
No.5
2011-05-25
No.4
2011-04-25
No.3
2011-03-25
No.2
2011-02-25
No.1
2011-01-25
Manuscript Center
  • Position
      Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn