CN100394424C - 具有二维线性可缩放并行结构的搜索引擎 - Google Patents
具有二维线性可缩放并行结构的搜索引擎 Download PDFInfo
- Publication number
- CN100394424C CN100394424C CNB2004100368058A CN200410036805A CN100394424C CN 100394424 C CN100394424 C CN 100394424C CN B2004100368058 A CNB2004100368058 A CN B2004100368058A CN 200410036805 A CN200410036805 A CN 200410036805A CN 100394424 C CN100394424 C CN 100394424C
- Authority
- CN
- China
- Prior art keywords
- search
- nodes
- node
- document
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000012545 processing Methods 0.000 claims abstract description 16
- 238000005192 partition Methods 0.000 claims description 34
- 238000004891 communication Methods 0.000 claims description 16
- 238000000034 method Methods 0.000 claims description 16
- 238000001914 filtration Methods 0.000 claims description 6
- 230000006978 adaptation Effects 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 238000013480 data collection Methods 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 3
- 230000011218 segmentation Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 2
- 238000000638 solvent extraction Methods 0.000 description 2
- 241000239290 Araneae Species 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/953—Organization of data
- Y10S707/956—Hierarchical
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99944—Object-oriented database structure
- Y10S707/99945—Object-oriented database structure processing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
- Error Detection And Correction (AREA)
- Image Processing (AREA)
- Charge And Discharge Circuits For Batteries Or The Like (AREA)
Abstract
Description
Claims (9)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
NO19992269 | 1999-05-10 | ||
NO992269A NO992269D0 (no) | 1999-05-10 | 1999-05-10 | S°kemotor med todimensjonalt skalerbart, parallell arkitektur |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB008101248A Division CN1153162C (zh) | 1999-05-10 | 2000-05-10 | 具有二维线性可缩放并行结构的搜索引擎 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1652108A CN1652108A (zh) | 2005-08-10 |
CN100394424C true CN100394424C (zh) | 2008-06-11 |
Family
ID=19903319
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB008101248A Expired - Lifetime CN1153162C (zh) | 1999-05-10 | 2000-05-10 | 具有二维线性可缩放并行结构的搜索引擎 |
CNB2004100368058A Expired - Lifetime CN100394424C (zh) | 1999-05-10 | 2000-05-10 | 具有二维线性可缩放并行结构的搜索引擎 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB008101248A Expired - Lifetime CN1153162C (zh) | 1999-05-10 | 2000-05-10 | 具有二维线性可缩放并行结构的搜索引擎 |
Country Status (15)
Country | Link |
---|---|
US (1) | US7330857B1 (zh) |
EP (1) | EP1208465B1 (zh) |
JP (1) | JP3586429B2 (zh) |
KR (1) | KR100457830B1 (zh) |
CN (2) | CN1153162C (zh) |
AT (1) | ATE439639T1 (zh) |
AU (1) | AU761169B2 (zh) |
BR (1) | BR0010427B8 (zh) |
CA (1) | CA2373453C (zh) |
CZ (1) | CZ20014002A3 (zh) |
DE (1) | DE60042745D1 (zh) |
HK (1) | HK1047178A1 (zh) |
NO (1) | NO992269D0 (zh) |
RU (1) | RU2226713C2 (zh) |
WO (1) | WO2000068834A1 (zh) |
Families Citing this family (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7359951B2 (en) * | 2000-08-08 | 2008-04-15 | Aol Llc, A Delaware Limited Liability Company | Displaying search results |
NO315887B1 (no) | 2001-01-04 | 2003-11-03 | Fast Search & Transfer As | Fremgangsmater ved overforing og soking av videoinformasjon |
US7379983B2 (en) | 2003-06-25 | 2008-05-27 | International Business Machines Corporation | Merging scalable nodes into single-partition merged system using service processors of nodes |
US7240064B2 (en) * | 2003-11-10 | 2007-07-03 | Overture Services, Inc. | Search engine with hierarchically stored indices |
US7672930B2 (en) * | 2005-04-05 | 2010-03-02 | Wal-Mart Stores, Inc. | System and methods for facilitating a linear grid database with data organization by dimension |
CN101369268B (zh) * | 2007-08-15 | 2011-08-24 | 北京书生国际信息技术有限公司 | 一种文档库系统中文档数据的存储方法 |
US20080033925A1 (en) * | 2006-08-07 | 2008-02-07 | Bea Systems, Inc. | Distributed search analysis |
US7725470B2 (en) * | 2006-08-07 | 2010-05-25 | Bea Systems, Inc. | Distributed query search using partition nodes |
US9015197B2 (en) | 2006-08-07 | 2015-04-21 | Oracle International Corporation | Dynamic repartitioning for changing a number of nodes or partitions in a distributed search system |
US8321376B2 (en) * | 2007-03-29 | 2012-11-27 | Telefonaktiebolaget Lm Ericsson (Publ) | Address resolving database |
WO2009078729A1 (en) * | 2007-12-14 | 2009-06-25 | Fast Search & Transfer As | A method for improving search engine efficiency |
KR101009444B1 (ko) * | 2008-01-29 | 2011-01-19 | 김운현 | 연속 가공형 앵글 헤드 |
US20090254523A1 (en) * | 2008-04-04 | 2009-10-08 | Yahoo! Inc. | Hybrid term and document-based indexing for search query resolution |
US8825646B1 (en) * | 2008-08-08 | 2014-09-02 | Google Inc. | Scalable system for determining short paths within web link network |
US8392394B1 (en) * | 2010-05-04 | 2013-03-05 | Google Inc. | Merging search results |
EP2423830A1 (de) | 2010-08-25 | 2012-02-29 | Omikron Data Quality GmbH | Verfahren zum Suchen in einer Vielzahl von Datensätzen und Suchmaschine |
US9529908B2 (en) | 2010-11-22 | 2016-12-27 | Microsoft Technology Licensing, Llc | Tiering of posting lists in search engine index |
US9195745B2 (en) | 2010-11-22 | 2015-11-24 | Microsoft Technology Licensing, Llc | Dynamic query master agent for query execution |
US9342582B2 (en) | 2010-11-22 | 2016-05-17 | Microsoft Technology Licensing, Llc | Selection of atoms for search engine retrieval |
US8478704B2 (en) | 2010-11-22 | 2013-07-02 | Microsoft Corporation | Decomposable ranking for efficient precomputing that selects preliminary ranking features comprising static ranking features and dynamic atom-isolated components |
US9424351B2 (en) | 2010-11-22 | 2016-08-23 | Microsoft Technology Licensing, Llc | Hybrid-distribution model for search engine indexes |
US8620907B2 (en) | 2010-11-22 | 2013-12-31 | Microsoft Corporation | Matching funnel for large document index |
US8713024B2 (en) | 2010-11-22 | 2014-04-29 | Microsoft Corporation | Efficient forward ranking in a search engine |
CN102436513B (zh) * | 2012-01-18 | 2014-11-05 | 中国电子科技集团公司第十五研究所 | 分布式检索方法和系统 |
US20150120844A1 (en) * | 2013-10-31 | 2015-04-30 | International Business Machines Corporation | Hierarchical response-enabled notification system |
US10120938B2 (en) * | 2015-08-01 | 2018-11-06 | MapScallion LLC | Systems and methods for automating the transmission of partitionable search results from a search engine |
US10380207B2 (en) * | 2015-11-10 | 2019-08-13 | International Business Machines Corporation | Ordering search results based on a knowledge level of a user performing the search |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1162154A (zh) * | 1996-03-12 | 1997-10-15 | 松下电器产业株式会社 | 数据搜索装置 |
CN1229218A (zh) * | 1998-03-17 | 1999-09-22 | 松下电器产业株式会社 | 信息检索装置和方法 |
JP2000010980A (ja) * | 1998-06-24 | 2000-01-14 | Nec Corp | データベース検索システム、データベース検索方法、および記録媒体 |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4860201A (en) * | 1986-09-02 | 1989-08-22 | The Trustees Of Columbia University In The City Of New York | Binary tree parallel processor |
WO1992006436A2 (en) * | 1990-10-03 | 1992-04-16 | Thinking Machines Corporation | Parallel computer system |
US5701459A (en) * | 1993-01-13 | 1997-12-23 | Novell, Inc. | Method and apparatus for rapid full text index creation |
JP3266351B2 (ja) * | 1993-01-20 | 2002-03-18 | 株式会社日立製作所 | データベース管理システムおよび問合せの処理方法 |
US7599910B1 (en) * | 1993-11-16 | 2009-10-06 | Hitachi, Ltd. | Method and system of database divisional management for parallel database system |
US5742806A (en) * | 1994-01-31 | 1998-04-21 | Sun Microsystems, Inc. | Apparatus and method for decomposing database queries for database management system including multiprocessor digital data processing system |
US5694593A (en) * | 1994-10-05 | 1997-12-02 | Northeastern University | Distributed computer database system and method |
AU5386796A (en) * | 1995-04-11 | 1996-10-30 | Kinetech, Inc. | Identifying data in a data processing system |
CA2150745C (en) * | 1995-06-01 | 2001-05-01 | Chaitanya K. Baru | Method and apparatus for implementing partial declustering in a parallel database system |
US5960194A (en) * | 1995-09-11 | 1999-09-28 | International Business Machines Corporation | Method for generating a multi-tiered index for partitioned data |
US5926811A (en) * | 1996-03-15 | 1999-07-20 | Lexis-Nexis | Statistical thesaurus, method of forming same, and use thereof in query expansion in automated text searching |
DE69632835T2 (de) * | 1996-04-29 | 2005-07-14 | Scientific Research Institute Of Different Branches "Integral" | Verfahren zur automatischen verarbeitung von information über benutzerdaten |
RU2096825C1 (ru) * | 1996-10-14 | 1997-11-20 | Общество с ограниченной ответственностью "Информбюро" | Устройство обработки информации для информационного поиска |
US6112198A (en) * | 1997-06-30 | 2000-08-29 | International Business Machines Corporation | Optimization of data repartitioning during parallel query optimization |
US6549519B1 (en) * | 1998-01-23 | 2003-04-15 | Alcatel Internetworking (Pe), Inc. | Network switching device with pipelined search engines |
JP3774324B2 (ja) * | 1998-08-03 | 2006-05-10 | 株式会社日立製作所 | ソート処理システムおよびソート処理の方法 |
US6370527B1 (en) * | 1998-12-29 | 2002-04-09 | At&T Corp. | Method and apparatus for searching distributed networks using a plurality of search devices |
-
1999
- 1999-05-10 NO NO992269A patent/NO992269D0/no unknown
-
2000
- 2000-05-10 DE DE60042745T patent/DE60042745D1/de not_active Expired - Lifetime
- 2000-05-10 AT AT00923028T patent/ATE439639T1/de not_active IP Right Cessation
- 2000-05-10 CN CNB008101248A patent/CN1153162C/zh not_active Expired - Lifetime
- 2000-05-10 CZ CZ20014002A patent/CZ20014002A3/cs unknown
- 2000-05-10 HK HK02108789.6A patent/HK1047178A1/zh unknown
- 2000-05-10 US US09/743,268 patent/US7330857B1/en not_active Expired - Lifetime
- 2000-05-10 JP JP2000616545A patent/JP3586429B2/ja not_active Expired - Lifetime
- 2000-05-10 EP EP00923028A patent/EP1208465B1/en not_active Expired - Lifetime
- 2000-05-10 BR BRPI0010427-2A patent/BR0010427B8/pt not_active IP Right Cessation
- 2000-05-10 CA CA002373453A patent/CA2373453C/en not_active Expired - Lifetime
- 2000-05-10 KR KR10-2001-7014313A patent/KR100457830B1/ko not_active Expired - Lifetime
- 2000-05-10 CN CNB2004100368058A patent/CN100394424C/zh not_active Expired - Lifetime
- 2000-05-10 AU AU43214/00A patent/AU761169B2/en not_active Expired
- 2000-05-10 WO PCT/NO2000/000155 patent/WO2000068834A1/en not_active Application Discontinuation
- 2000-05-10 RU RU2001133092/09A patent/RU2226713C2/ru not_active IP Right Cessation
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1162154A (zh) * | 1996-03-12 | 1997-10-15 | 松下电器产业株式会社 | 数据搜索装置 |
CN1229218A (zh) * | 1998-03-17 | 1999-09-22 | 松下电器产业株式会社 | 信息检索装置和方法 |
JP2000010980A (ja) * | 1998-06-24 | 2000-01-14 | Nec Corp | データベース検索システム、データベース検索方法、および記録媒体 |
Also Published As
Publication number | Publication date |
---|---|
DE60042745D1 (de) | 2009-09-24 |
AU4321400A (en) | 2000-11-21 |
NO992269D0 (no) | 1999-05-10 |
CN1652108A (zh) | 2005-08-10 |
JP2002544598A (ja) | 2002-12-24 |
CZ20014002A3 (cs) | 2002-04-17 |
HK1047178A1 (zh) | 2003-02-07 |
RU2226713C2 (ru) | 2004-04-10 |
BR0010427B8 (pt) | 2013-02-19 |
KR20020006715A (ko) | 2002-01-24 |
ATE439639T1 (de) | 2009-08-15 |
BR0010427A (pt) | 2002-02-19 |
CN1153162C (zh) | 2004-06-09 |
CA2373453C (en) | 2005-08-16 |
AU761169B2 (en) | 2003-05-29 |
JP3586429B2 (ja) | 2004-11-10 |
WO2000068834A1 (en) | 2000-11-16 |
EP1208465B1 (en) | 2009-08-12 |
CN1360701A (zh) | 2002-07-24 |
US7330857B1 (en) | 2008-02-12 |
CA2373453A1 (en) | 2000-11-16 |
KR100457830B1 (ko) | 2004-11-18 |
EP1208465A1 (en) | 2002-05-29 |
BR0010427B1 (pt) | 2013-01-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100394424C (zh) | 具有二维线性可缩放并行结构的搜索引擎 | |
Tomasic et al. | Performance of inverted indices in shared-nothing distributed text document information retrieval systems | |
US6701317B1 (en) | Web page connectivity server construction | |
Hull | Document image matching and retrieval with multiple distortion-invariant descriptors | |
US8131724B2 (en) | System for similar document detection | |
Chen et al. | I/O-efficient techniques for computing PageRank | |
Al-Dhelaan et al. | A new strategy for processors allocation in an n-cube multiprocessor | |
Lee et al. | Metadata management of the SANtopia file system | |
Sharma | A generic machine for parallel information retrieval | |
Kargupta et al. | PADMA: Parallel data mining agents for scalable text classification | |
Boswell | Distributed High Performance Web Crawlers: A Survey of the State of the Art | |
Hawking et al. | A Parallel Document Retrieval Server for the World Wide Web | |
WO2009078729A1 (en) | A method for improving search engine efficiency | |
Mohammed et al. | Novel parallel join algorithms for grid files | |
Aktug et al. | Signature files: An integrated access method for formatted and unformatted databases | |
WO2008032340A2 (en) | Method and system for processing geometrical layout design data | |
NO313347B1 (no) | Søkemotor med todimensjonalt lineær skalerbar, parallell arkitektur | |
Gao | A hierarchical document clustering algoritm | |
Sun et al. | Implementation of large-scale distributed information retrieval system | |
Ribeiro et al. | Distributed parallel generation of pat arrays | |
Daoud | Perfect hash functions for large dictionaries | |
Motzkin et al. | Parallel organization and performance of an information system | |
Ok | A multicomputer query processing model for full-text retrieval using compressed bitmaps | |
Catal | Parallel Text Retrieval on PC Clusters | |
Lemström | A Client-Server Extension to PICSearch system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20090306 Address after: American California Patentee after: Yahoo Corp. Address before: American California Patentee before: Overture Services Inc. |
|
ASS | Succession or assignment of patent right |
Owner name: YAHOO| CO.,LTD. Free format text: FORMER OWNER: WAFUL TOURS SERVICES Effective date: 20090306 |
|
ASS | Succession or assignment of patent right |
Owner name: FEIYANG MANAGEMENT CO., LTD. Free format text: FORMER OWNER: YAHOO CORP. Effective date: 20150331 |
|
TR01 | Transfer of patent right |
Effective date of registration: 20150331 Address after: The British Virgin Islands of Tortola Patentee after: Fly upward Management Co., Ltd Address before: American California Patentee before: Yahoo Corp. Effective date of registration: 20150331 Address after: The British Virgin Islands of Tortola Patentee after: Fly upward Management Co., Ltd Address before: American California Patentee before: Yahoo Corp. |
|
CX01 | Expiry of patent term |
Granted publication date: 20080611 |
|
CX01 | Expiry of patent term |