[go: up one dir, main page]

CN109086344A - Full-text retrieval method for cloud computing platform - Google Patents

Full-text retrieval method for cloud computing platform Download PDF

Info

Publication number
CN109086344A
CN109086344A CN201810760927.3A CN201810760927A CN109086344A CN 109086344 A CN109086344 A CN 109086344A CN 201810760927 A CN201810760927 A CN 201810760927A CN 109086344 A CN109086344 A CN 109086344A
Authority
CN
China
Prior art keywords
search
request
search engine
full
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810760927.3A
Other languages
Chinese (zh)
Inventor
郑向阳
钟送来
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Xianyu Fansheng Information Technology Co ltd
Original Assignee
Guangzhou Xianyu Fansheng Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Xianyu Fansheng Information Technology Co ltd filed Critical Guangzhou Xianyu Fansheng Information Technology Co ltd
Priority to CN201810760927.3A priority Critical patent/CN109086344A/en
Publication of CN109086344A publication Critical patent/CN109086344A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A full-text retrieval method of a cloud computing platform comprises a data storage server and a search server, wherein a search engine system is arranged on the search server and comprises an aggregator and a plurality of search engine nodes; the full text retrieval method comprises the following steps: s1, the collector obtains a plurality of search requests according to the initial retrieval request submitted by a user and distributes the search requests to a plurality of search engine nodes; and S2, the search engine node receives the search request and sends an index establishment request to a message bus according to the search request. The invention provides a full-text retrieval method of a cloud computing platform, which can provide real-time, large-concurrency, low-time-delay and various complex condition sequencing searches for e-commerce application.

Description

A kind of method of the full-text search of cloud computing platform
Technical field
The present invention relates to field of computer technology, especially a kind of method of the full-text search of cloud computing platform.
Background technique
With the development of information technology, internet role in people's lives work is more and more important, more next More people begins to use internet, especially in the electronics quotient such as household electrical appliances, number, medicine, dress ornament, phonotape and videotape, household, retailing Business field, large-scale e-commerce system start to come into operation, and user's purchase is analyzed by cloud computing platform and big data technology Object hobby.However as being continuously increased for user volume, essence is exactly the increase of data volume, storage towards mass data and is answered With also flourishing, the requirement to cloud computing platform retrieval is also increasingly improved.
In searching mass data application, full-text search is one of most common statistical function, and availability and efficiency are straight It connects and influences service application and user experience.Full-text search is a kind of by text used in file and the matched text money of search terms Expect search method, it may be convenient to carry out the ASSOCIATE STATISTICS and analysis to data;But the existing full text applied to cloud computing platform There is the problems such as long search delay, search result dispersion in search method, scalability and availability are poor.
Summary of the invention
In order to overcome the disadvantages mentioned above of the prior art, the object of the present invention is to provide a kind of full-text searches of cloud computing platform Method, can be provided for E-business applications in real time, concurrency is big, search of the sequence of low time delay, Various Complex condition.
The technical solution adopted by the present invention to solve the technical problems is: a kind of side of the full-text search of cloud computing platform Method, the cloud computing platform include data storage server and search server, and described search server is equipped with search engine System, described search automotive engine system include collector and multiple search engine nodes;The method of the full-text search includes step It is as follows:
S1. the collector requests to obtain several searching requests according to the initial retrieval that user submits, and to more A search engine node distribution described search request;
S2. described search engine node receives described search request, and is sent and established to messaging bus according to searching request Index request;
S3. the messaging bus receives the foundation index request, and stores to multiple data of data storage server Index request is established in system distribution;
S4. the data-storage system carries out full-text search to storing data according to foundation index request, will search for As a result search engine node is fed back to by messaging bus;
S5. search result is sent to collector by described search engine node, and the collector summarizes described as a result, feedback To user.
Preferably, the initial retrieval request includes retrieval sentence, and described search request includes that at least one retrieval is crucial Word;The step S1 further comprises that steps are as follows:
S11. the collector receives retrieval sentence, analyzes the content of the retrieval sentence, obtains several inspections Rope word;
S12. it determines in several described terms with the presence or absence of no meaning auxiliary word, if so, the nothing that will be determined Meaning auxiliary word is deleted;It determines and whether there is conjunctive word in several described terms after deleting, if so, by the conjunctive word It is combined;
S13. by after combination the conjunctive word, after deletion described in several terms in addition to the conjunctive word Other terms as the search key, described search request is distributed to multiple search engine nodes.
Preferably, described search engine node includes multiple search engine node instances, and the step S2 further comprises Steps are as follows:
S21. described search engine node by the search key of the searching request received distribute to that it includes it is multiple Search engine node instance, described search engine node instance obtain foundation index request according to the search key;
S22. judge whether searching request quantity is more than the quantity of described search engine node, if it is, increasing search The quantity of engine node;
S23. judge the search key quantity whether be more than described search engine node instance quantity, if so, Then increase the quantity of search engine node instance.
Preferably, the data storage server is equipped with data-storage system, and the data-storage system is by Key- Value data object storage engines and Key-List data queue storage engines composition, the Key-List data queue storage Engine includes multiple lists, and each list includes multiple labels, and each label includes multiple objects, and the label uses block chain Structure;The step S4 further comprises: the content for establishing index request retrieved in the multiple label, it is described to build The content that lithol draws request includes search key;If label matches, retrieved in the object of respective labels;Root According to location information of the store path and object of label in corresponding label, determine that the foundation index request corresponds to Index relative.
Compared with prior art, the beneficial effects of the present invention are: a kind of method of the full-text search of cloud computing platform, branch Distributed search and in real time search are held, is searched for by multiple search engine nodal parallels, concurrency is big, to reach low time delay effect Fruit, and collector distributes searching request and summarizes search result, it can be achieved that the search that Various Complex condition sorts;Meanwhile it searching for The quantity of engine node can increase as needed, have high scalability and high availability.
Specific embodiment
Now in conjunction with embodiment, the present invention is further described:
Case study on implementation: a kind of method of the full-text search of cloud computing platform, the cloud computing platform include data storage clothes Business device and search server, described search server are equipped with search engine system, and described search automotive engine system includes collector With multiple search engine nodes;The method of the full-text search comprises the following steps that
S1. the collector requests to obtain several searching requests according to the initial retrieval that user submits, and to more A search engine node distribution described search request;
S2. described search engine node receives described search request, and is sent and established to messaging bus according to searching request Index request;
S3. the messaging bus receives the foundation index request, and stores to multiple data of data storage server Index request is established in system distribution;
S4. the data-storage system carries out full-text search to storing data according to foundation index request, will search for As a result search engine node is fed back to by messaging bus;
S5. search result is sent to collector by described search engine node, and the collector summarizes described as a result, feedback To user.
The initial retrieval request includes retrieval sentence, and described search request includes at least one search key;It is described Step S1 further comprises that steps are as follows:
S11. the collector receives retrieval sentence, analyzes the content of the retrieval sentence, obtains several inspections Rope word;
S12. it determines in several described terms with the presence or absence of no meaning auxiliary word, if so, the nothing that will be determined Meaning auxiliary word is deleted;It determines and whether there is conjunctive word in several described terms after deleting, if so, by the conjunctive word It is combined;
S13. by after combination the conjunctive word, after deletion described in several terms in addition to the conjunctive word Other terms as the search key, described search request is distributed to multiple search engine nodes.
Described search engine node includes multiple search engine node instances, the step S2 further comprise step such as Under:
S21. described search engine node by the search key of the searching request received distribute to that it includes it is multiple Search engine node instance, described search engine node instance obtain foundation index request according to the search key;
S22. judge whether searching request quantity is more than the quantity of described search engine node, if it is, increasing search The quantity of engine node;
S23. judge the search key quantity whether be more than described search engine node instance quantity, if so, Then increase the quantity of search engine node instance.
The data storage server is equipped with data-storage system, and the data-storage system is by Key-Value data Object storage engines and Key-List data queue storage engines composition, Key-List data queue storage engines include more A list, each list include multiple labels, and each label includes multiple objects, and the label uses block chain structure;It is described Step S4 further comprises: the content for establishing index request being retrieved in the multiple label, the foundation index is asked The content asked includes search key;If label matches, retrieved in the object of respective labels;According to label The location information of store path and object in corresponding label determines that the foundation index requests corresponding index to close System.
In conclusion after those skilled in the art read file of the present invention, according to the technique and scheme of the present invention with Technical concept is not necessarily to creative mental labour and makes other various corresponding conversion schemes, belongs to the model that the present invention is protected It encloses.

Claims (4)

1. a kind of method of the full-text search of cloud computing platform, the cloud computing platform includes data storage server and search clothes Be engaged in device, which is characterized in that described search server be equipped with search engine system, described search automotive engine system include collector and Multiple search engine nodes;The method of the full-text search comprises the following steps that
S1. the collector requests to obtain several searching requests according to the initial retrieval that user submits, and searches to multiple Index holds up node distribution described search request;
S2. described search engine node receives described search request, and is sent according to searching request to messaging bus and establish index Request;
S3. the messaging bus receives the foundations index and requests, and to multiple data-storage systems of data storage server Index request is established in distribution;
S4. the data-storage system carries out full-text search to storing data according to foundation index request, by search result Search engine node is fed back to by messaging bus;
S5. search result is sent to collector by described search engine node, and the collector summarizes described as a result, feeding back to use Family.
2. a kind of method of the full-text search of cloud computing platform according to claim 1, which is characterized in that the initial inspection Rope request includes retrieval sentence, and described search request includes at least one search key;The step S1 further comprises step It is rapid as follows:
S11. the collector receives retrieval sentence, analyzes the content of the retrieval sentence, obtains several terms;
S12. it determines in several described terms with the presence or absence of no meaning auxiliary word, if so, the no meaning that will be determined Auxiliary word is deleted;It determines and whether there is conjunctive word in several described terms after deleting, if so, the conjunctive word is carried out Combination;
S13. by after combination the conjunctive word, after deletion described in its in addition to the conjunctive word in several terms His term is distributed to multiple search engine nodes as the search key, by described search request.
3. a kind of method of the full-text search of cloud computing platform according to claim 2, which is characterized in that described search is drawn Holding up node includes multiple search engine node instances, and the step S2 further comprises that steps are as follows:
S21. the search key of the searching request received is distributed to its multiple search for including by described search engine node Engine node instance, described search engine node instance obtain foundation index request according to the search key;
S22. judge whether searching request quantity is more than the quantity of described search engine node, if it is, increasing search engine The quantity of node;
S23. judge whether the search key quantity is more than the quantity of described search engine node instance, if it is, increasing Add the quantity of search engine node instance.
4. a kind of method of the full-text search of cloud computing platform according to claim 3, which is characterized in that the data are deposited It stores up server and is equipped with data-storage system, the data-storage system is by Key-Value data object storage engines and Key- List data queue storage engines composition, Key-List data queue storage engines include multiple lists, each list packet Multiple labels are included, each label includes multiple objects, and the label uses block chain structure;The step S4 further comprises: The content for establishing index request is retrieved in the multiple label, the content for establishing index request includes that retrieval is closed Keyword;If label matches, retrieved in the object of respective labels;According to the store path and object of label Location information in corresponding label determines that the foundation index requests corresponding index relative.
CN201810760927.3A 2018-07-12 2018-07-12 Full-text retrieval method for cloud computing platform Pending CN109086344A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810760927.3A CN109086344A (en) 2018-07-12 2018-07-12 Full-text retrieval method for cloud computing platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810760927.3A CN109086344A (en) 2018-07-12 2018-07-12 Full-text retrieval method for cloud computing platform

Publications (1)

Publication Number Publication Date
CN109086344A true CN109086344A (en) 2018-12-25

Family

ID=64837535

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810760927.3A Pending CN109086344A (en) 2018-07-12 2018-07-12 Full-text retrieval method for cloud computing platform

Country Status (1)

Country Link
CN (1) CN109086344A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111339421A (en) * 2020-02-28 2020-06-26 腾讯科技(深圳)有限公司 Information search method, device, equipment and storage medium based on cloud technology
CN111708935A (en) * 2020-05-15 2020-09-25 深圳市世强元件网络有限公司 Multi-node word segmentation system and method for keyword search

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102375860A (en) * 2010-08-27 2012-03-14 盛乐信息技术(上海)有限公司 Cloud-based structured data search method
CN102722531A (en) * 2012-05-17 2012-10-10 北京大学 Query method based on regional bitmap indexes in cloud environment
CN105740472A (en) * 2016-03-14 2016-07-06 中国科学院计算技术研究所 Distributed real-time full-text search method and system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102375860A (en) * 2010-08-27 2012-03-14 盛乐信息技术(上海)有限公司 Cloud-based structured data search method
CN102722531A (en) * 2012-05-17 2012-10-10 北京大学 Query method based on regional bitmap indexes in cloud environment
CN105740472A (en) * 2016-03-14 2016-07-06 中国科学院计算技术研究所 Distributed real-time full-text search method and system

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111339421A (en) * 2020-02-28 2020-06-26 腾讯科技(深圳)有限公司 Information search method, device, equipment and storage medium based on cloud technology
CN111339421B (en) * 2020-02-28 2023-02-28 腾讯科技(深圳)有限公司 Information search method, device, equipment and storage medium based on cloud technology
CN111708935A (en) * 2020-05-15 2020-09-25 深圳市世强元件网络有限公司 Multi-node word segmentation system and method for keyword search

Similar Documents

Publication Publication Date Title
CN107451208A (en) A kind of data search method and device
CN104933112A (en) Distributed Internet transaction information storage and processing method
US8843507B2 (en) Serving multiple search indexes
CN108509437A (en) A kind of ElasticSearch inquiries accelerated method
CN107943952A (en) A kind of implementation method that full-text search is carried out based on Spark frames
CN102799587A (en) Forum searching method and device
CN108287862A (en) Search engine
WO2014029314A1 (en) Information aggregation, classification and display method and system
CN109086344A (en) Full-text retrieval method for cloud computing platform
CN105279159A (en) Contact prompting method and device
US11494350B2 (en) Building of knowledge base and FAQ from voice, chat, email, and social interactions
CN100561477C (en) Based on key word and shared searching method and the system of classification
CN109711879A (en) Power business application intelligent interactive method and system based on micro services framework
CN103020300B (en) Method and device for information retrieval
CN114139048A (en) Tracking method for user behavior data and page data
CN102724290A (en) Method, device and system for getting target customer group
CN106445968A (en) Data merging method and device
CN113806446A (en) Rapid retrieval method for mass data of big data
CN108804502A (en) Big data inquiry system, method, computer equipment and storage medium
CN108492230A (en) The system and method for internet service resource popularization based on distributed structure/architecture
Chen et al. Trip oriented search on activity trajectory
WO2024131091A1 (en) Information association method and apparatus, device, and storage medium
CN107992568B (en) Searching method, device and system
CN113032436B (en) Searching method and device based on article content and title
CN103455523A (en) Method and server for searching information

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20181225

RJ01 Rejection of invention patent application after publication