[go: up one dir, main page]

skip to main content
article

Proposal of two-stage patent retrieval method considering the claim structure

Published: 01 June 2005 Publication History

Abstract

The importance of patents is increasing in global society. In preparing a patent application, it is essential to search for related patents that may invalidate the invention. However, it is time-consuming to identify them among the millions of patents. This article proposes a patent-retrieval method that considers a claim structure for a more accurate search for invalidity. This method uses a claim text as input; it consists of two retrieval stages. In stage 1, general text analysis and retrieval methods are applied to improve recall. In stage 2, the top N documents retrieved in stage 1 are rearranged to improve precision by applying text analysis and retrieval methods using the claim structure. Our two-stage retrieval introduces five precision-oriented analysis and retrieval methods: query-term extraction from a portion of a claim text that describes the characteristics of a claim; query term-weighting without term frequency; query term-weighting with “measurement terms”; text retrieval using only claims as a target; and calculating the relevant score by “partially” adding scores in stage 2 to those in stage 1. Evaluation results using test sets of the NTCIR4 Patent Retrieval Task show that our methods are effective, though the degree of the effectiveness varies depending on the test sets.

References

[1]
ACL 2003. Proceedings of ACL 2003 Workshop on Patent Corpus Processing. http://www.slis.tsukuba.ac.jp/~fujii/acl2003ws.html.
[2]
Bear, J. et al. 1997. Using information extraction to improve document retrieval. In Proceedings of the Sixth Text Retrieval Conference (TREC-6). 367--378.
[3]
Fujii, A. et al. 2004. Overview of patent retrieval task at NTCIR-4. In Working Notes of the Fourth NTCIR Workshop Meeting. 225--232.
[4]
Clarke, C. L. et al. 1997. Relevance ranking for one to three term queries. In Proceedings of RIAO-97, 5th International Conference “Recherche d'Information Assistee par Ordinateur” 388--400.
[5]
Itoh, H. 2004. NTCIR-4 patent retrieval experiments at RICOH. In Working Notes of the Fourth NTCIR Workshop Meeting. 246--249.
[6]
Kando, N. 2004. Overview of the Fourth NTCIR Workshop. In Working Notes of the Fourth NTCIR Workshop Meeting. i--viii.
[7]
Konishi, K. et al. 2004. Invalidity patent search system of NTT DATA. In Working Notes of the Fourth NTCIR Workshop Meeting. 250--255.
[8]
Mase, H. et al. 1996. Experimental simulation for automatic patent categorization. In Proceedings of the Conference on Advances in Production Management Systems. 377--382.
[9]
Mase, H. et al. 2004. Two-stage patent retrieval method considering claim structure. In Working Notes of the Fourth NTCIR Workshop Meeting. 256--261.
[10]
Matsumoto, Y. 2000. Morphological analysis system “Chasen.” J. Information Processing Society of Japan. 41, 11 (2000), 1208--1214 (in Japanese).
[11]
Mizuno, Y. 2002. Relevant document retrieval system. Tokugikon J. 223 (2000) (in Japanese).
[12]
NTCIR3 2002. Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering. http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings3/index.html.
[13]
NTCIR4 2004. Working Notes of the Fourth NTCIR Workshop Meeting. http://research.nii.ac.jp/ntcir-ws4/NTCIR4-WN/.
[14]
Salton, G. and Mcgill, M. J. 1983. Introduction to Modern Information Retrieval. McGraw-Hill, New York.
[15]
Sarasua, L. and Corremans, G. 2000. Cross lingual issues in patent retrieval. In Proceedings of the ACM SIGIR 2000 Workshop on Patent Retrieval. ACM, New York. http://research.nii.ac.jp/ntcir/sigir2000ws/sigirprws-sarasua.pdf.
[16]
Shinmori, A. et al. 2003. Patent claim processing for readability structure analysis and term explanation. In Proceedings of the ACL 2003 Conference. 56--65.
[17]
SIGIR 2000. Proceedings of the ACM SIGIR 2000 Workshop on Patent Retrieval. ACM, New York. http://research.nii.ac.jp/ntcir/sigir2000ws/.
[18]
Sumner, R. G., Jr. and Shaw, W. M., Jr. 1997. An investigation of relevance feedback using adaptive linear and probabilistic models. The Fifth Text Retrieval Conference (TREC-5). E. M. Voorhees and D. K. Harman (eds.).
[19]
Takano, A. et al. 2002. Development of the generic association engine for processing large corpora http://geta.ex.nii.ac.jp/pdf/itx2002.pdf. (in Japanese).
[20]
Takeuchi, H. et al. 2004. Experiments on patent retrieval at NTCIR-4 Workshop. In Working Notes of the Fourth NTCIR Workshop Meeting. 271--276.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Asian Language Information Processing
ACM Transactions on Asian Language Information Processing  Volume 4, Issue 2
June 2005
179 pages
ISSN:1530-0226
EISSN:1558-3430
DOI:10.1145/1105696
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 June 2005
Published in TALIP Volume 4, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Patent retrieval
  2. claim structure
  3. relevant score calculation
  4. term extraction
  5. term weighting

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)CoPatEProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557270(1104-1113)Online publication date: 17-Oct-2022
  • (2021)PQPSMobile Information Systems10.1155/2021/24977702021Online publication date: 1-Jan-2021
  • (2020)Automatically transforming full length biomedical articles into search queries for retrieving related articlesEgyptian Informatics Journal10.1016/j.eij.2020.04.004Online publication date: May-2020
  • (2020)Challenges in Patent Information RetrievalEvaluating Information Retrieval and Access Tasks10.1007/978-981-15-5554-1_4(49-69)Online publication date: 2-Sep-2020
  • (2017)Claim-based patent indicators: A novel approach to analyze patent content and monitor technological advancesWorld Patent Information10.1016/j.wpi.2017.08.00850(64-72)Online publication date: Sep-2017
  • (2017)Retrieval Models Versus RetrievabilityCurrent Challenges in Patent Information Retrieval10.1007/978-3-662-53817-3_7(185-212)Online publication date: 26-Mar-2017
  • (2016)An Improved Retrievability-Based Cluster-Resampling Approach for Pseudo Relevance FeedbackComputers10.3390/computers50400295:4(29)Online publication date: 15-Nov-2016
  • (2016)Finding similar patents through semantic expansion2016 International Conference on Computer Communication and Informatics (ICCCI)10.1109/ICCCI.2016.7479982(1-5)Online publication date: Jan-2016
  • (2016)Keyword Based Search and its Limitations in the Patent Document to Secure the Idea from its InfringementProcedia Computer Science10.1016/j.procs.2016.02.08678:C(439-446)Online publication date: 1-Mar-2016
  • (2015)Improving Patent Search by Search Result DiversificationProceedings of the 2015 International Conference on The Theory of Information Retrieval10.1145/2808194.2809455(201-210)Online publication date: 27-Sep-2015
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media