Query analysis for search engine in E-commerce Web sites
-
摘要: 在电子商务网站中,关键字搜索是用户查询商品的一种重要手段,其中计算检索串中的切词权重则是搜索引擎查询处理时的一个重要步骤.本文总结了现有计算切词权重方法的不足,提出了一种新方法.该方法结合切词重要性和相关性确定切词权重,有效地提高了切词权重确定的准确性,是一种切实可行的计算切词权重的方法.Abstract: In most E-commerce web sites, key word search is an important way for users to find commodities, and calculation of the term weights is a major step for search engine to process queries. In order to overcome the shortness of existing methods in calculation of term weights in query processing, this paper proposed a new method, which combines importance with relevance of terms when calculating term weights. It effectively improves the correctness of the determination of the term weights, and is practical and reliable.
-
Key words:
- information retrieval /
- search engine /
- term weight
-
[1] [1] JONES K S, WALKER S, ROBERTSON S E. A probabilistic model of information retrieval: development and comparative experiments [J]. Information Processing and Management, 36(6): 779-808.[2] PONTE J M, CROFT W B. A language modeling approach to information retrieval[C]//Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval. 1998: 275-281. [3] AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules[C]//Proc 20th Int Conf VLDB, 1994:487-499.[4] TAN P N, STEINBACH M, KUMAR V. Introduction to Data Mining[M].[S.l.]:Addison-Wesleg. 2005.[5] SCOTT C D, SUSAN T D, THOMAS K L, et al. Indexing by latent semantic analysis[J]. Journal of the American Society for Information Science, 1990, 41(6): 391-407.
点击查看大图
计量
- 文章访问数: 1686
- HTML全文浏览量: 63
- PDF下载量: 2551
- 被引次数: 0