Author's Guide

MORE>>
  • 2018 No.6
  • Published:25 June 2018
  • Directed by: Chiness Academy of Sciences
  • Sponsored by: National Science Library, Chinese Academy of Sciences
  • Published by: Editorial Board of New
    Data Analysis and Knowledge Discovery
      25 June 2018, Volume 2 Issue 6 Previous Issue   
    For Selected: View Abstracts Toggle Thumbnails
    Impacts of “Poster-Follower” Sentiment on Stock Market Performance
    Ning Zhang,Lemin Yin,Lifeng He
    2018, 2 (6): 1-12.  DOI: 10.11925/infotech.2096-3467.2017.1174
    Abstract   HTML   PDF (680KB) ( 0 )

    [Objective] The paper investigates the relationship between the “Bullish Sentiment Index” (BSI) of online reviews/following comments and the performance of stock market. [Methods] First, we conducted sentiment classification for comments on Shanghai Stock Exchange Composite Index using semantic analysis method. Then, we built the sentiment tendencies of these reviews and constructed their “Poster-Follower” BSI. Finally, we used linear and nonlinear models to examine the proposed method empirically. [Results] The BSI based on our proposed method (text mining) could effectively predict the stock market trend, especially on its returns. [Limitations] We only consider two emotional polarities and more research is needed to enhance the sentimental strength. [Conclusions] The Bullish Sentiment Index could effectively predict the overall stock market trend by measuring investors’ sentiment.

    Figures and Tables | References | Related Articles | Metrics
    Incentive Investments on Information Security for Libraries: An Evolutionary Game-theory Approach
    Guang Zhu,Mining Feng,Weiwei Zhang
    2018, 2 (6): 13-24.  DOI: 10.11925/infotech.2096-3467.2017.1101
    Abstract   HTML   PDF (4800KB) ( 1 )

    [Objective] This paper analyzes the library’s investment on information security from the benefit and cost perspectives, aiming to improve the effectiveness and efficiency of library security management. [Methods] First, we used the evolutionary game theory to define two players: library and technical enterprise. Then, we explored the intentions of investments on information security. Third, we analyzed the benefits and costs of investments, the payoff matrices and evolutionarily stable strategies (ESS). Finally, we designed an incentive mechanism to enhance the investment on information security. [Results] The investments from libraries and enterprises were correlated with benefit growth and cost reduction. If the benefit growth was small, the game players are less likely to invest. Once the profit growth became big, the game players tend to invest and then generated different strategies. [Limitations] We did not design the nonlinear profit function. Other factors, such as user’s demands and advertisement effects should also be included. [Conclusions] This study promotes the development of information security management in library.

    Figures and Tables | References | Supplementary Material | Related Articles | Metrics
    Identifying Competitive Intelligence Based on Knowledge Element
    Lin Sun,Yanzhang Wang
    2018, 2 (6): 25-36.  DOI: 10.11925/infotech.2096-3467.2017.0996
    Abstract   HTML   PDF (6646KB) ( 0 )

    [Objective] This study tries to identify competitive intelligence based on implicit correlated knowledge, aiming to help enterprises have upper hands in the fierce competition. [Methods] First, we constructed a knowledge system for competitive intelligence based on the metadata. Then we generated a network with the help of relationship among the attributes of these metadata. Finally we identifed competitive intelligencey through similarity analysis and merging multi-attributes. [Results] We successfully established a network for the properties of knowledge metadata from the enterprise’s financial and sales index, R&D ability and other resources. We identified the business ties based on the intelligence metadata of product HS, and merged the metadata of MGIS market planning. [Limitations] The proposed system could be improved with larger sample size. [Conclusions] This study solves the issues facing complex relation identification and intelligence analysis demands. It also benefits the competitive advantage evaluation, crisis warning, and decision making.

    Figures and Tables | References | Related Articles | Metrics
    Analyzing Public Opinion from Microblog with Topic Clustering and Sentiment Intensity
    Xiufang Wang,Shu Sheng,Yan Lu
    2018, 2 (6): 37-47.  DOI: 10.11925/infotech.2096-3467.2017.1107
    Abstract   HTML   PDF (3060KB) ( 0 )

    [Objective] This paper builds a model to monitor the trending topics from microblogs, aiming to deal with the issues of text drifting and quantitation of sentimental polarity. [Methods] First, we proposed a public opinion analysis model based on topic clustering and sentiment intensity. Then, we used the time series regression analysis to predict the sentimental changes among the trending topics. [Results] The prediction accuracy of our model reached 88.97%, which was about 7% higher than the iLab-Edinburgh model. [Limitations] More research is needed to study the early warning mechanisms for emergency events. [Conclusions] The proposed model could improve the prediction accuracy of sentimental changes, which provides an effective way to analyze the public opinion from microblogs.

    Figures and Tables | References | Related Articles | Metrics
    Impacts and Corrections of Natural Weight on Nonlinear Sci-tech Reviews——Case Study of TOPSIS Method
    Liping Yu,Xiayun Song,Zuogong Wang
    2018, 2 (6): 48-57.  DOI: 10.11925/infotech.2096-3467.2017.1124
    Abstract   HTML   PDF (619KB) ( 0 )

    [Objective] This paper explores the implicit natural weight issues facing the scientific and technology review indexes, and then proposes a method to address them. [Methods] First, we analyzed data from the JCR2016 mathematics journals with the help of TOPSIS method, aiming to find the influence of natural weights on the nonlinear evaluation method. Then, we proposed a method increasing the dynamic maximum mean to the standardized level, aiming to eliminate the impacts. [Results] We found that the natural weights posed significant effects to the Nonlinear Evaluation methods. For the weighted method, the design weights, the natural weights and the evaluation methods all affected the actual weights. For the non-weighted method, the natural weights and the evaluation methods affected the actual weights. Eliminating the natural weights could effectively reduce the influence of the evaluation method on the actual weights, which helps the design weights play a bigger role. The distribution of index data also affected the actual weights. [Limitations] The proposed method is still an approximation algorithm, which could not yield the exactly equal means. [Conclusions] To achieve the fair review for the science and technology products, we must pay attention to the natural weights issues, which is a systematic error.

    Figures and Tables | References | Supplementary Material | Related Articles | Metrics
    The Correlation Between Altmetrics and Citations
    Pengmin Wu,Ting Chen,Xiaomei Wang
    2018, 2 (6): 58-69.  DOI: 10.11925/infotech.2096-3467.2018.0354
    Abstract   HTML   PDF (4743KB) ( 0 )

    [Objective] This paper studies the characteristics of the Altmetrics for high quality journal articles, including their correlations with citation numbers, differences in disciplines, and the contribution of sub-indicators. These Altmetrics are also compared with previous results. [Methods] We selected 68 journals from Nature Index as data sources, and used machine learning method to classify papers published by them. Then, we used Spearman correlation test to find relationship between Altmetrics and traditional citation indexes, as well as the contributions of sub-indicators in various disciplines. Finally, we evaluated the effectiveness of using Altmetrics to identify highly-cited papers, with the help of ROC curve analysis. [Results] There were significant differences in the performance of Altmetrics among disciplines. In high-quality journals, the correlation between Altmetrics and citations were enhanced, and the contributions of News, Blog, and Twitter to the Altmetrics were also increased. Altmetrics could help us identify highly cited papers. [Limitations] The data collection period is short, and the data set needs to be expanded based on the characteristics of the disciplines. [Conclusions] Compared with previous research results of full data sets, Altmetrics for high-quality journal articles are unique, and the correlation between Altmetrics and citations is enhanced.

    Figures and Tables | References | Related Articles | Metrics
    Analyzing Growth Trends and Attachment Mode of Social Blog Tags
    Guanghui Ye,Jinglan Hu,Jian Xu,Lixin Xia
    2018, 2 (6): 70-78.  DOI: 10.11925/infotech.2096-3467.2017.1311
    Abstract   HTML   PDF (680KB) ( 0 )

    [Objective] This study reveals the forming mechanism of network nodes, aiming to examine the growth trend and attachment mode of social blog tags. [Methods] Firstly, we proposed the model of tag growth with the help of statistics and network analysis. Then, we established the categories of tag links and corresponding numbers, as well as summarized the connection rules of newly added tags. Finally, we defined the indicators of degree dependency and examined the probability of tag connection following preferential attachment modes. [Results] The tag growth showed the linear growth pattern and the distribution of tags had one single peak center, the shock left side and the gentle right side, which did not meet the power-law distribution. [Limitations] We did not explain the impacts of users’ tagging behaviors on the network connections. [Conclusions] Neither the “new tag-old tag” nor the “old tag-old tag” models are not fully compliant with the preferential attachment mode.

    Figures and Tables | References | Related Articles | Metrics
    Identifying E-commerce User Types Based on Complex Network Overlapping Community
    Xiaodong Qian,Min Li
    2018, 2 (6): 79-91.  DOI: 10.11925/infotech.2096-3467.2018.0101
    Abstract   HTML   PDF (2339KB) ( 0 )

    [Objective] This paper presents an algorithm to identify composite types of e-commerce users, aiming to improve e-commerce operators’ personalized marketing services. [Methods] First, we built the node distance matrix based on the characteristics of user access sequences. Then, we modified the Jaro-Winkler distance algorithm from the perspectives of redefining matching number, editing cost and rules. Third, we used the improved algorithm to calculate the user access sequence distance matrix. Based on the distance matrix, we distinguished the central and non-central users to construct a complex network for identifying user composite types. We used the improved CNM algorithm to obtain the initial user types. With the help of fuzzy membership function for user optimization, we obtained their composite types. [Results] Compared to CONGA, the NMI of the proposed algorithm was improved by 15.60%. The algorithm was also applied to examine the real user’s online data, and its overall clustering coefficient was 10.87% higher than the CONGA. The time complexity of the new algorithm was reduced too. [Limitations] The proposed algorithm needs to set three parameters subjectively. [Conclusions] The user network conforms to the characteristics of a small-world model and has the typical morphology of a complex network. The algorithm can effectively identify the composite types of e-commerce users.

    Figures and Tables | References | Related Articles | Metrics
    Extracting Topics and Their Relationship from College Student Mentoring
    Beibei Pang,Juanqiong Gou,Wenxin Mu
    2018, 2 (6): 92-101.  DOI: 10.11925/infotech.2096-3467.2018.0066
    Abstract   HTML   PDF (6340KB) ( 0 )

    [Objective] This paper proposes a framework for small-scale knowledge acquisition and modeling, aiming to more effectively manage the College Students’ deep mentoring work. [Methods] Firstly, we used the LDA to identify topics of collected documents, as well as the phrases describing the topics. Secondly, we used the concept hierarchy analysis to get the relations among these topics. Finally, we encoded ontology of the modeling results for knowledge retrieval. [Results] This study further refined the granularity of topic knowledge on the basis of LDA modeling, which reduced the difficulty of topic modeling and describe their relationship. [Limitations] We did not examine the expanded knowledge base generated by the new depth mentoring documents. [Conclusions] The proposed framework supports the modeling and retrieval of multi granularity knowledge from deep counseling, such as identifying problems, communication methods, and guiding skills.

    Figures and Tables | References | Related Articles | Metrics
    Collaborative Filtering Algorithm Based on Gray Correlation Analysis and Time Factor
    Daoping Wang,Zhongyang Jiang,Boqing Zhang
    2018, 2 (6): 102-109.  DOI: 10.11925/infotech.2096-3467.2018.0017
    Abstract   HTML   PDF (683KB) ( 0 )

    [Objective] This paper presents a collaborative filtering algorithm based on gray correlation analysis and time factor, aiming to address the low similarity resolvability and user’s interest drifting issues of the traditional algorithms. [Methods] First, we proposed a new method to calculate user similarity based on gray relational degree. Then, we used the time weight function to improve the Pearson correlation coefficients. Third, we created a hybrid similarity calculation method and made recommendation based on the neighbors of the target user. Finally, we used the MovieLens dataset to examine the new algorithm. [Results] Compared with the traditional collaborative filtering algorithms and those considering gray correlation analysis or time factor alone, the proposed algorithm reduced the mean absolute error (MAE). [Limitations] It takes the proposed algorithm longer time to calculate the hybrid similarity. [Conclusions] The hybrid similarity method improves the accuracy of recommended items for the target users and has a very good commercial promotion prospect.

    Figures and Tables | References | Related Articles | Metrics
Investigating the Evolution Path and Hot Topics of Citizen Science Studies Abroad
Zhang Xuanhui, Zhao Yuxiang
DOI: 10.11925/infotech.2096-3467.2017.0519
2018
Vol.2
No.5 
2018-05-25
No.4
2018-04-25
No.3
2018-03-25
No.2
2018-02-25
No.1
2018-01-25
2017
Vol.1
No.12 
2017-12-25
No.11
2017-11-25
No.10
2017-10-25
No.9
2017-09-25
No.8
2017-08-25
No.7
2017-07-25
No.6
2017-06-25
No.5
2017-05-25
No.4
2017-04-25
No.3
2017-03-25
No.2
2017-02-25
No.1
2017-01-25
2016
Vol.32
No.12 
2016-12-25
No.11
2016-11-25
No.10
2016-10-25
No.9
2016-09-25
No.7-8
2016-08-25
No.6
2016-06-25
No.5
2016-05-25
No.4
2016-04-25
No.3
2016-03-25
No.2
2016-02-25
No.1
2016-01-25
2015
Vol.31
No.12 
2015-12-25
No.11
2015-11-25
No.10
2015-10-25
No.9
2015-09-25
No.7-8
2015-08-25
No.6
2015-06-25
No.5
2015-05-25
No.4
2015-04-25
No.3
2015-03-25
No.2
2015-02-25
No.1
2015-01-25
2014
Vol.30
No.12 
2014-12-25
No.11
2014-11-25
No.10
2014-10-25
No.9
2014-09-25
No.7
2014-08-25
No.6
2014-06-25
No.5
2014-05-25
No.4
2014-04-25
No.3
2014-03-25
No.2
2014-02-25
No.1
2014-01-25
2013
Vol.29
No.12 
2013-12-25
No.11
2013-11-25
No.10
2013-10-25
No.9
2013-09-25
No.7
2013-08-25
No.6
2013-06-25
No.5
2013-05-25
No.4
2013-04-25
No.3
2013-03-25
No.2
2013-02-25
No.1
2013-01-25
2012
Vol.28
No.12 
2012-12-25
No.11
2012-11-25
No.10
2012-10-25
No.9
2012-09-25
No.7
2012-08-25
No.6
2012-06-25
No.5
2012-05-25
No.4
2012-04-25
No.3
2012-03-25
No.2
2012-02-25
No.1
2012-01-25
2011
Vol.27
No.12 
2011-12-25
No.11
2011-11-25
No.10
2011-10-25
No.9
2011-09-25
No.7
2011-08-25
No.6
2011-06-25
No.5
2011-05-25
No.4
2011-04-25
No.3
2011-03-25
No.2
2011-02-25
No.1
2011-01-25
Manuscript Center
  • Position
      Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn