[go: up one dir, main page]

HK1047173A1 - 用於主題分段,段意義和段操作的方法和系統 - Google Patents

用於主題分段,段意義和段操作的方法和系統

Info

Publication number
HK1047173A1
HK1047173A1 HK02108592.3A HK02108592A HK1047173A1 HK 1047173 A1 HK1047173 A1 HK 1047173A1 HK 02108592 A HK02108592 A HK 02108592A HK 1047173 A1 HK1047173 A1 HK 1047173A1
Authority
HK
Hong Kong
Prior art keywords
segment
significance
function
segmentation
topical
Prior art date
Application number
HK02108592.3A
Other languages
English (en)
Inventor
R Mckeown Kathleen
L Klavans Judith
Kan Min-Yen
Original Assignee
Univ Columbia
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univ Columbia filed Critical Univ Columbia
Publication of HK1047173A1 publication Critical patent/HK1047173A1/zh

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
HK02108592.3A 1999-04-12 2002-11-28 用於主題分段,段意義和段操作的方法和系統 HK1047173A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/290,643 US6473730B1 (en) 1999-04-12 1999-04-12 Method and system for topical segmentation, segment significance and segment function
PCT/US2000/009733 WO2000062194A2 (en) 1999-04-12 2000-04-12 Method and system for topical segmentation, segment significance and segment function

Publications (1)

Publication Number Publication Date
HK1047173A1 true HK1047173A1 (zh) 2003-02-07

Family

ID=23116939

Family Applications (1)

Application Number Title Priority Date Filing Date
HK02108592.3A HK1047173A1 (zh) 1999-04-12 2002-11-28 用於主題分段,段意義和段操作的方法和系統

Country Status (7)

Country Link
US (1) US6473730B1 (zh)
EP (1) EP1208456A2 (zh)
AU (1) AU768495B2 (zh)
CA (1) CA2370032A1 (zh)
HK (1) HK1047173A1 (zh)
IL (2) IL145874A0 (zh)
WO (1) WO2000062194A2 (zh)

Families Citing this family (70)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7925610B2 (en) * 1999-09-22 2011-04-12 Google Inc. Determining a meaning of a knowledge item using document-based information
US8914361B2 (en) * 1999-09-22 2014-12-16 Google Inc. Methods and systems for determining a meaning of a document to match the document to content
US8051104B2 (en) * 1999-09-22 2011-11-01 Google Inc. Editing a network of interconnected concepts
AU2001251736A1 (en) * 2000-03-27 2001-10-08 Documentum, Inc Method and apparatus for generating metadata for a document
US6968332B1 (en) * 2000-05-25 2005-11-22 Microsoft Corporation Facility for highlighting documents accessed through search or browsing
AU2000269782A1 (en) * 2000-09-07 2002-03-22 Intel Corporation Method and apparatus for summarizing multiple documents using a subsumption model
US7210100B2 (en) * 2000-09-27 2007-04-24 Eizel Technologies, Inc. Configurable transformation of electronic documents
US7613810B2 (en) * 2000-09-27 2009-11-03 Nokia Inc. Segmenting electronic documents for use on a device of limited capability
US7165023B2 (en) * 2000-12-15 2007-01-16 Arizona Board Of Regents Method for mining, mapping and managing organizational knowledge from text and conversation
US7254773B2 (en) * 2000-12-29 2007-08-07 International Business Machines Corporation Automated spell analysis
US7565605B2 (en) * 2001-05-08 2009-07-21 Nokia, Inc. Reorganizing content of an electronic document
US20030093565A1 (en) * 2001-07-03 2003-05-15 Berger Adam L. System and method for converting an attachment in an e-mail for delivery to a device of limited rendering capability
US7610189B2 (en) * 2001-10-18 2009-10-27 Nuance Communications, Inc. Method and apparatus for efficient segmentation of compound words using probabilistic breakpoint traversal
AU2002365162A1 (en) * 2001-11-14 2003-07-09 Northwestern University Self-assembly and mineralization of peptide-amphiphile nanofibers
WO2003070749A2 (en) * 2002-02-15 2003-08-28 Northwestern University Self-assembly of peptide-amphiphile nanofibers under physiological conditions
US20040076930A1 (en) * 2002-02-22 2004-04-22 Steinberg Linda S. Partal assessment design system for educational testing
US20060004732A1 (en) * 2002-02-26 2006-01-05 Odom Paul S Search engine methods and systems for generating relevant search results and advertisements
US7340466B2 (en) * 2002-02-26 2008-03-04 Kang Jo Mgmt. Limited Liability Company Topic identification and use thereof in information retrieval systems
US7716207B2 (en) * 2002-02-26 2010-05-11 Odom Paul S Search engine methods and systems for displaying relevant topics
US7534761B1 (en) 2002-08-21 2009-05-19 North Western University Charged peptide-amphiphile solutions and self-assembled peptide nanofiber networks formed therefrom
KR100481580B1 (ko) * 2002-10-09 2005-04-08 한국전자통신연구원 문서에서 이벤트 문장을 추출하는 장치 및 그 방법
US7554021B2 (en) * 2002-11-12 2009-06-30 Northwestern University Composition and method for self-assembly and mineralization of peptide amphiphiles
AU2003295562A1 (en) * 2002-11-14 2004-06-15 Educational Testing Service Automated evaluation of overly repetitive word use in an essay
US7683025B2 (en) 2002-11-14 2010-03-23 Northwestern University Synthesis and self-assembly of ABC triblock bola peptide amphiphiles
US20040133560A1 (en) * 2003-01-07 2004-07-08 Simske Steven J. Methods and systems for organizing electronic documents
AU2004210853A1 (en) * 2003-02-11 2004-08-26 Northwestern University Methods and materials for nanocrystalline surface coatings and attachment of peptide amphiphile nanofibers thereon
CN101482881B (zh) * 2003-07-30 2013-12-11 Google公司 用于确定文档的含义以使文档与内容匹配的方法和系统
US7379929B2 (en) * 2003-09-03 2008-05-27 Yahoo! Inc. Automatically identifying required job criteria
WO2005056576A2 (en) * 2003-12-05 2005-06-23 Northwestern University Branched peptide amphiphiles, related epitope compounds and self assembled structures thereof
JP4851939B2 (ja) 2003-12-05 2012-01-11 ノースウエスタン ユニバーシティ 自己−集合性ペプチド両親媒性物質および増殖因子送達のための関連する方法
US7689536B1 (en) 2003-12-18 2010-03-30 Google Inc. Methods and systems for detecting and extracting information
US7970600B2 (en) * 2004-11-03 2011-06-28 Microsoft Corporation Using a first natural language parser to train a second parser
EP1669896A3 (en) * 2004-12-03 2007-03-28 Panscient Pty Ltd. A machine learning system for extracting structured records from web pages and other text sources
CN101150954A (zh) * 2005-01-21 2008-03-26 西北大学 包封细胞的方法和组合物
WO2006093928A2 (en) 2005-02-28 2006-09-08 Educational Testing Service Method of model scaling for an automated essay scoring system
EP1853917A4 (en) * 2005-03-04 2008-09-10 Univ Northwestern ANGIOGENIC HEPARIN BINDING EPITOPES, PEPTIDE AMPHIPHILES, SELF-ASSEMBLED COMPOSITIONS AND METHODS OF USE THEREOF
US20060277028A1 (en) * 2005-06-01 2006-12-07 Microsoft Corporation Training a statistical parser on noisy data by filtering
US8209335B2 (en) 2005-09-20 2012-06-26 International Business Machines Corporation Extracting informative phrases from unstructured text
WO2007064639A2 (en) * 2005-11-29 2007-06-07 Scientigo, Inc. Methods and systems for providing personalized contextual search results
US8645397B1 (en) * 2006-11-30 2014-02-04 At&T Intellectual Property Ii, L.P. Method and apparatus for propagating updates in databases
US8076295B2 (en) * 2007-04-17 2011-12-13 Nanotope, Inc. Peptide amphiphiles having improved solubility and methods of using same
US7917840B2 (en) 2007-06-05 2011-03-29 Aol Inc. Dynamic aggregation and display of contextually relevant content
US8042053B2 (en) * 2007-09-24 2011-10-18 Microsoft Corporation Method for making digital documents browseable
US8229921B2 (en) * 2008-02-25 2012-07-24 Mitsubishi Electric Research Laboratories, Inc. Method for indexing for retrieving documents using particles
US8145482B2 (en) * 2008-05-25 2012-03-27 Ezra Daya Enhancing analysis of test key phrases from acoustic sources with key phrase training models
US8984398B2 (en) * 2008-08-28 2015-03-17 Yahoo! Inc. Generation of search result abstracts
KR101023209B1 (ko) * 2008-10-13 2011-03-18 한국전자통신연구원 문서 번역 장치 및 그 방법
JP2012523463A (ja) * 2009-04-13 2012-10-04 ノースウエスタン ユニバーシティ 軟骨再生のための新規なペプチドベースの足場およびその使用方法
CN102023989B (zh) * 2009-09-23 2012-10-10 阿里巴巴集团控股有限公司 一种信息检索方法及其系统
US8150859B2 (en) * 2010-02-05 2012-04-03 Microsoft Corporation Semantic table of contents for search results
US8788260B2 (en) * 2010-05-11 2014-07-22 Microsoft Corporation Generating snippets based on content features
US9069754B2 (en) * 2010-09-29 2015-06-30 Rhonda Enterprises, Llc Method, system, and computer readable medium for detecting related subgroups of text in an electronic document
CN102737017B (zh) * 2011-03-31 2015-03-11 北京百度网讯科技有限公司 一种提取页面主题的方法和装置
US8799967B2 (en) * 2011-10-25 2014-08-05 At&T Intellectual Property I, L.P. Using video viewing patterns to determine content placement
US9495357B1 (en) * 2013-05-02 2016-11-15 Athena Ann Smyros Text extraction
US9575958B1 (en) * 2013-05-02 2017-02-21 Athena Ann Smyros Differentiation testing
US9262510B2 (en) * 2013-05-10 2016-02-16 International Business Machines Corporation Document tagging and retrieval using per-subject dictionaries including subject-determining-power scores for entries
US9251136B2 (en) 2013-10-16 2016-02-02 International Business Machines Corporation Document tagging and retrieval using entity specifiers
US9235638B2 (en) 2013-11-12 2016-01-12 International Business Machines Corporation Document retrieval using internal dictionary-hierarchies to adjust per-subject match results
US10303745B2 (en) * 2014-06-16 2019-05-28 Hewlett-Packard Development Company, L.P. Pagination point identification
US9971760B2 (en) * 2014-12-22 2018-05-15 International Business Machines Corporation Parallelizing semantically split documents for processing
CN104572927B (zh) * 2014-12-29 2016-06-29 北京奇虎科技有限公司 一种从单页面中提取小说名称的方法和装置
US11379538B1 (en) * 2016-05-19 2022-07-05 Artemis Intelligence Llc Systems and methods for automatically identifying unmet technical needs and/or technical problems
US11392651B1 (en) 2017-04-14 2022-07-19 Artemis Intelligence Llc Systems and methods for automatically identifying unmet technical needs and/or technical problems
CN111742322A (zh) 2017-12-29 2020-10-02 罗伯特·博世有限公司 用于使用深度神经网络来进行独立于领域和语言的定义提取的系统和方法
US11645110B2 (en) 2019-03-13 2023-05-09 International Business Machines Corporation Intelligent generation and organization of user manuals
CN110110326B (zh) * 2019-04-25 2020-10-27 西安交通大学 一种基于主题信息的文本切割方法
US11762916B1 (en) 2020-08-17 2023-09-19 Artemis Intelligence Llc User interface for identifying unmet technical needs and/or technical problems
US11947892B1 (en) * 2021-03-15 2024-04-02 Claimably Llc Plain-text analysis of complex document formats
CN117708434B (zh) * 2024-01-09 2024-06-28 青岛睿哲信息技术有限公司 一种基于关键词的用户推荐浏览内容生成方法

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3691844B2 (ja) * 1990-05-21 2005-09-07 株式会社東芝 文書処理方法
US5392428A (en) * 1991-06-28 1995-02-21 Robins; Stanford K. Text analysis system
CA2078423C (en) * 1991-11-19 1997-01-14 Per-Kristian Halvorsen Method and apparatus for supplementing significant portions of a document selected without document image decoding with retrieved information
US5642520A (en) 1993-12-07 1997-06-24 Nippon Telegraph And Telephone Corporation Method and apparatus for recognizing topic structure of language data
US5799268A (en) * 1994-09-28 1998-08-25 Apple Computer, Inc. Method for extracting knowledge from online documentation and creating a glossary, index, help database or the like
US5887120A (en) * 1995-05-31 1999-03-23 Oracle Corporation Method and apparatus for determining theme for discourse
US5913185A (en) * 1996-08-19 1999-06-15 International Business Machines Corporation Determining a natural language shift in a computer document
US6038560A (en) * 1997-05-21 2000-03-14 Oracle Corporation Concept knowledge base search and retrieval system
US6070133A (en) * 1997-07-21 2000-05-30 Battelle Memorial Institute Information retrieval system utilizing wavelet transform

Also Published As

Publication number Publication date
CA2370032A1 (en) 2000-10-19
WO2000062194A2 (en) 2000-10-19
IL145874A (en) 2006-12-31
US6473730B1 (en) 2002-10-29
AU768495B2 (en) 2003-12-11
EP1208456A2 (en) 2002-05-29
IL145874A0 (en) 2002-07-25
AU4233400A (en) 2000-11-14
WO2000062194A3 (en) 2002-03-21

Similar Documents

Publication Publication Date Title
HK1047173A1 (zh) 用於主題分段,段意義和段操作的方法和系統
ZA200205634B (en) Sourcing system and method.
EP1049088A4 (en) SYSTEM, DEVICE AND METHOD FOR PROCESSING INFORMATION
GB0208759D0 (en) Transaction system and method therefor
AU4317699A (en) Device, method, and system for clothing organization
HK1041398A1 (en) Method and apparatus for minimizing overhead in a communication system.
MXPA01007779A (es) Metodo para efectuar pagos sin efectivo y un sistema para implementar el metodo.
HK1045777A1 (zh) 用於迷你向導實施的系統與方法
IL141733A0 (en) System and method for low latency communication
GB2354609B (en) Method and system for predicting transactions
IL134554A0 (en) System and method for sharing bookmark information
GB2352851B (en) Search system and method based on search condition combinations
AU2108101A (en) Information gateway system and method
IL140019A0 (en) Communication system and communication method
ZA200204150B (en) Communication method and system.
IL130167A0 (en) Shopping system and method
IL135955A0 (en) Information processing system and information processing method thereof
SG92668A1 (en) Information processing system
AU6116700A (en) Network-based transaction system and method
GB0029202D0 (en) Biometrics system and method
HK1033871A1 (en) Information processing device, its method and information processing system and medium thereof
AU4145400A (en) Information processing method, information processing system and information processor
AU2064001A (en) System and method for metabolic profiling
AU6503700A (en) Flow-through capacitor, system and method
AU3079900A (en) Internet-advertising method and system therefor