Similar Document Retrieval among the Different Kind of National R&D Outcomes
All research or development activities produce many kinds of outcome such as article, patent, research report, human resources information, application method for some equipment, experimental data and so on. The NTIS (National Science & Technology Information Service) in Korea offers a unified search service using national R&D outcomes data to researchers. But this function does not meet the academic requirements of users who want to use the relevance of papers, patents, research reports, etc. It is needs to display related documents together when a user stays in a page which offers detail metadata about one outcome, this helps users to diminish effort to search their interesting information. In this paper, we propose the method for similar document retrieval among heterogeneous kinds of R&D outcomes. A combination of user query and search factor extracted from the search engine are used to search some similar documents, and the boosting technology using the author field and subject code (S&T standard code) field is applied to document ranking process. We show usefulness of proposed method in this paper as developing the intelligent system of NTIS or many metadata search services.
KeywordsSimilar Document Retrieval NTIS Data Relevance
Unable to display preview. Download preview PDF.
- 2.Chen, C.-M., Liu, D.-R.: Tree indexing for efficient search of similar documents. In: Computer Software and Applications Conference, pp. 210–211. IEEE Comput. Soc. (2000)Google Scholar
- 3.Fox, T.W.: Document vector compression and its application in document clustering. In: Canadian Conference on Computer Engineering, pp. 2029–2032. IEEE (2005)Google Scholar
- 5.FAST ESP User ManualGoogle Scholar