CN101146152B - Information collection and search system for telecommunication information station - Google Patents
Information collection and search system for telecommunication information station Download PDFInfo
- Publication number
- CN101146152B CN101146152B CN2006101542065A CN200610154206A CN101146152B CN 101146152 B CN101146152 B CN 101146152B CN 2006101542065 A CN2006101542065 A CN 2006101542065A CN 200610154206 A CN200610154206 A CN 200610154206A CN 101146152 B CN101146152 B CN 101146152B
- Authority
- CN
- China
- Prior art keywords
- information
- module
- server
- service
- database
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to an information collection and query system used for number wizard business, which comprises an information collection server used for processing the information input by various information sources, a data storage server used for storing various data of a system and an information management and operating server used for managing information, key words and lists and business rules and providing information query service, wherein the information collection server, the data storage server and the information management and operating server are connected with each other through IP network and by the adoption of TCP agreement. A plurality of information sources are collected and processed uniformly. Structured information and non-structured information are searched jointly. The search results are arrayed and displayed according to the business rules so that information collection and query is implemented according to the query demand of the user. The system structure and function division of the system is relatively concrete, reasonable and easy to be implemented and further has relatively sound expansibility and flexibility.
Description
Technical field
The present invention relates to a kind of information gathering and inquiry system, exactly, relate to a kind of information gathering of this type telecommunications information station that is used for the Best Tone Service business and the architectural framework of search system.
Background technology
Internet intelligent information gathering (being referred to as web crawlers usually) and information search (being referred to as search engine usually) technology have only obtained reasonable application at present in the search service of the Internet, this technology is mainly used to collect and preserve web content data, and the non-structured text information of webpage one class is carried out index and inquiry.By querying conditions such as input keywords, the information that satisfies condition is searched by system in index data base, and carries out sequencing display according to preset rule.
The main business function of telecommunication information station is accurately to provide needed information to the user who dials information station's access code, therefore has only and adopts the system with powerful information gathering and function of search could satisfy its requirement.Different with Webpage search is, on the service access mode, information station adopts single access code, by the incoming call of call center's process user, and on queue machine calling is distributed to and attends a banquet.In information index and inquiry, Best Tone Service has a large amount of structured message (data that refer to according to certain format and require to preserve), and few relatively unstructured information (text data), can realize association by keyword between structured message and the unstructured information.Move at present, landline telephone has been very universal, therefore, transform by operation system, adopt the architecture that has more flexibility and autgmentability to build new information gathering and search system, can satisfy the demand of Best Tone Service aspect the information operation existing Best Tone Service.
Information gathering and search system are important component parts of telecommunication information station business support system, and the business information of abundant information platform, the efficient that improves inquiry and accuracy, reduction are looked into no rate and carried out industry head and look into new business such as class and have important function.Therefore, the information gathering of telecommunication information station and search system need possess necessary function such as information gathering, processing, storage, issue, index, inquiry, Business Rule Management.Realize above-mentioned functions, system just must support to obtain data from various information sources, support the content work flow definition of collaboration type, need the Syndicating search of realization, and the result that will search for carries out sequencing display according to the requirement of business rule structural data and unstructured data.Therefore, these software systems need design well-known architecture, to satisfy the demand of information station's business development.A software system structure reasonable in design has significant role to the actual motion of whole system.
Summary of the invention
The objective of the invention is to provide a kind of information gathering and information search system of novelty for the voice messaging inquiry service of Best Tone Service, the architecture of this system and function are divided more specific, reasonable, be easy to realize, and possess reasonable autgmentability and flexibility.
For realizing purpose of the present invention, provide a kind of information gathering and inquiry system that is used for the Best Tone Service business.Described system comprises information collection server, data storage server and information management and Operation Server at least, wherein, described information collection server is used to handle the information of various information source inputs, and adopts Transmission Control Protocol to be connected with Operation Server with information management by IP network; Described information management and Operation Server are used for managing and provide the information inquiry service to information, antistop list and business rule, and adopt Transmission Control Protocol to be connected with information collection server with data storage server by IP network; Described data storage server is used for the various data of storage system;
Described data storage server also comprises: service information database, and it is the relevant database of standard, is used for stored information platform professional required all data, antistop list and business rule, and realizes relevant search function; Data transaction/synchronous gateway is used for the structuring and the destructuring content-data synchronization replication of service information database are arrived the full-text index database; And the full-text index database, be used for the structuring and the destructuring content-data of coming are synchronously carried out full-text index, and the keyword retrieval interface is provided;
And described system unifies collection and processing, structured message and unstructured information is carried out Syndicating search a plurality of information sources by information collection server, data storage server, information management and Operation Server, and to the result of search according to the business rule demonstration of ranking, carry out information gathering and inquiry with query demand according to the user.
Preferably, described information gathering can be connected with a plurality of information collection servers with Operation Server with described information management in the inquiry system, and by Transmission Control Protocol the relevant configured parameter of information gathering is issued information collection server; Described information collection server also comprises: the internet information acquisition module, be configured according to the parameter that receives by information collection server, be used for the content of website on the Internet is grasped and sending data to information management and Operation Server by Transmission Control Protocol according to the parameter that sets; Integrated service support system IBSS sign indicating number information-change processing module is used to handle information such as the number of the generation change of bringing from IBSS and organization, address, and these information are formatd processing; The list input module is used to handle the structured message of manual input, and this module can be according to the field of definition of industry input, and the content of input is carried out format checking; The task input module, input information is gathered the particular content of personnel's information search task; And SP/CP information input interface module, be used to handle the information of bringing from the system of SP/CP, and be the XML form Data Format Transform.
Wherein, described internet information acquisition module institute operational factor comprises: the login parameters of uniform resource position mark URL, acquisition time, search depth, search range, website, information classification keyword or the like; And described internet information acquisition module also is used for dynamically collecting in real time the information of every profession and trade website on the Internet, and add side information by all the other modules and originate, original sign indicating number information is expanded, replenished relevant value-added information, set up the Best Tone Service service information database.
Preferably, described information management and Operation Server in described information gathering and the inquiry system also comprise: the information collection module is used for the information that information collection server sends is handled; System management module is used to be provided with the management of system access right, message processing flow management and the information content; The keyword administration module is used to manage the content and the business rule relevant with keyword of antistop list, realizes the binding of keyword and business rule, and to the management of keyword sales situation; Statistical analysis module is used for respectively the information of integrated information database being added up by pre-defined rule; The user inquiring history module is used to the problem that shows that certain user inquired about recently, to help to attend a banquet user's demand is analyzed, and this module also is used for the problem that all user inquirings cross and analyzes, and excavates user's demand and focus inquiry; And the keyword retrieval module, be used for retrieving according to keyword.
Wherein, described information collection module is introduced the information credibility model, this model is according to the credibility of these information of parameter evaluation such as number of connection of the significance level of the affiliated industry of information, issuing time, affiliated web site, webpage, and preferentially the high information of credibility handled.
Wherein, described information credibility model is a processing module that Reliability of Information is estimated, it is the pre-process module of information collection module, be used for the Reliability of Information that enters the information collection module is estimated and given a mark, important information carried out priority treatment to make things convenient for the information collection personnel.
Preferably, the processing that described information collection module is carried out information comprises: the information automation Intelligent treatment comprises automatic classification, heavy, the field analysis of row automatically; And the artificial treatment of information.
Wherein, described keyword retrieval module also comprises: second service information database, corresponding to the described service information database in the described system, be used to store and manage antistop list, business rule, businessman's contract (Merchant ID, keyword ID, weights) and the user inquiring behavior record of each local network; And full-text database, corresponding to the described full-text index database in the described system, be used to store each local network Business Information and the Internet value-added information through audit.
Wherein, described keyword retrieval module also comprises: the Business Rule Engine module, be used for the standardization word segmentation processing is carried out in the inquiry that the user submits to, and, with this querying condition the data of full-text database are searched for then in conjunction with the business rule generated query condition in the storage service information database; And the retrieval ordering engine modules, be used for the result of search is carried out sequencing display according to business rule.
Advantage of the present invention is: (1) is independent of the speech processing device of call center, because this system realizes is processing and search to the information content, goes to realize and leave the function of speech processes for Call Center Platform.Therefore this system can carry out integrated use with various Call Center Platform, widely applicable, highly versatile easily.(2) can support a plurality of information collection servers and flexible configuration: in information gathering of the present invention and inquiry system, each information management and Operation Server can be supported one or more information collection servers, like this, along with business development to the increase of amount of information demand with to the access situation of the Internet, can carry out information collection server flexibly and be configured.For example, the present invention can increase (or removing) corresponding information collection server at any time according to user's demand.(3) good maintainability: in information gathering of the present invention and inquiry system, if need to revise the flow process that certain category information is handled, a handling process that only needs to revise this category information gets final product, and this modification can not influence the handling process of other classification information.The handling process of newly-increased certain category information also can just can be finished by configuration in system if desired.(4) extensibility of information gathering and search performance: the present invention can be according to specific circumstances, the software module of information collection server, data storage server, information management and Operation Server is arranged in the different calculating moves, handle the scalable deployment of capacity to realize whole system.
The enforcement of this system has important function to the information resources of enriching Best Tone Service information station, can strengthen simultaneously the information operation level of Best Tone Service, promote the Best Tone Service business by being mainly to change with the number information, for the development of Best Tone Service business provides the information support to number+multimedia information service.
Description of drawings
From the description of the following preferred embodiment of the present invention that mode with non-limitative example is provided and from appended drawings, can clear more these and other characteristics of the present invention, advantage and beneficial effect, wherein:
Fig. 1 is according to the information gathering of telecommunication information station and the composition structural representation of search system of being used for of the present invention;
Fig. 2 forms structural representation according to information search of the present invention and result treatment module, is the specific implementation of the keyword retrieval module of Fig. 1;
Fig. 3 is according to the information gathering of telecommunication information station and the information processing overall procedure schematic diagram of search system of being used for of the present invention.
Embodiment
Below in conjunction with the drawings; preferred implementation of the present invention is described; should be appreciated that; here the preferred implementation of Miao Shuing not is restrictive explanation; those skilled in the art can be according to principle of the present invention, the present invention is made various modifications, improvement and can not break away from the protection range that claim limits of enclosing.
The objective of the invention is to provide a kind of information gathering and information search system of novelty for the voice messaging inquiry service of Best Tone Service, the architecture of this system and function are divided more specific, reasonable, be easy to realize, and possess reasonable autgmentability and flexibility.
Technologically speaking, according to information gathering and the search system that is used for such as the telecommunication information station of the voice messaging inquiry service of Best Tone Service of the present invention, belong to internet intelligent information gathering and information search technique field, it is a kind of required content of information station on the automatic searching for Internet of intelligent web information collection server of utilizing, process, manage, store, issue to the handling process of content and to content by the definition of information collection server then, the operator utilizes search engine inquiry to dial the system of user's information needed of information station.
The present invention also provides a kind of method of structured message and unstructured information being carried out storage and uniform and index on the basis of this information gathering and inquiry system structural framing.This method is by structured message being put in XML (Extensible Markup Language, the extend markup language) file, with the unstructured information storage and uniform in the central information storehouse, then these information are unified index and generate.
Below in conjunction with accompanying drawing preferred implementation of the present invention is described, to specify implementation method of the present invention.
Referring to Fig. 1, wherein show according to the information gathering of telecommunication information station and the composition structural representation of search system of being used for of the present invention.Preferably, the present invention is a kind of information gathering and search system that is applied to this type telecommunications information station of Best Tone Service.Described system comprises:
Be used to handle the information collection server 101 of various information source inputs, this information collection server 101 is connected with Operation Server 103 with information management by IP network.Information management and Operation Server 103 are by IP (Internet Protocol, Internet protocol) net adopts TCP (Transfer Control Protocol, transmission control protocol) relevant configured parameter of information gathering is issued information collection server 101, information collection server 101 carries out parameter configuration according to the parameter that receives to internet information acquisition module 1011, and internet information acquisition module 1011 grasps the content of website on the Internet according to the parameter that sets and sends data to information management and Operation Server 103 by Transmission Control Protocol.Described internet information acquisition module 1011 is used for dynamically collecting in real time the information of every profession and trade website on the Internet, and add side information by all the other modules and originate, original sign indicating number information is expanded, replenished relevant value-added information, set up the Best Tone Service service information database.
Except internet information acquisition module 1011, information collection server 101 also comprises: IBSS (Integrated Business Support System, the integrated service support system) sign indicating number information-change processing module 1012, list input module 1013, task input module 1014, SP/CP (Service Provider/Content Provider, service provider/content supplier) information input interface module 1015.IBSS sign indicating number information-change processing module 1012 is used to handle information such as the number of the generation change of bringing from IBSS and organization, address, and these information are formatd processing.List input module 1013 is used to handle the structured message of manual input, and this module can be according to the field of definition of industry input, and the content of input is carried out format checking.Task input module 1014 is used for the particular content of input information collection personnel's information search task, and for example, the information gathering task of so-and-so information collector is to finish the food and drink Business Information investigation of so-and-so community in certain time.SP/CP information input interface module 1015 is used to handle the information of bringing from the system of SP/CP, and is XML form (as follows) with Data Format Transform.Above-mentioned module sends result to information management and Operation Server 103 by Transmission Control Protocol with data.
Above-mentioned employing XML form is realized the encapsulation of data and passed through the Transmission Control Protocol transmission of standard, and is specific as follows:
<?xml?version=″1.0″?>
<contentdata?version=″1.0″timestamp=″″>
<structure>
<unitname>
<namestring></namestring>
<alias></alias>
<level></level>
</unitname>
<callnumber>
<first></dirst>
<second></second>
</callnumber>
<address></address>
<linkman></linkman>
</structure>
<expand>
<trade></trade>
<comment></comment>
<general?situation></general?situation>
…
</expand>
</contentdata>
Be used for information, antistop list and business rule are managed and provide the information management and the Operation Server 103 of information inquiry service, be connected with information collection server 101 with data storage server 102 by IP network.Information management and Operation Server 103 are kept at the data that information collection server 101 sends in the data storage server 102 by network.The functional module of information management and Operation Server 103 is handled the data in the data storage server 102, and the result also is kept in the data storage server 102.This information management can be connected with one or more information collection servers 101 with Operation Server 103.If do not dispose cluster, 103 of information management and Operation Servers are connected with a data storage server 102, otherwise are connected with a plurality of data storage servers 102.Information management and Operation Server 103 comprise with lower module: information collection module 1031, system management module 1032, keyword administration module 1033, statistical analysis module 1034, user inquiring history module 1035 and keyword retrieval module 1036.
Information collection module 1031 is used for the information that information collection server 101 sends is handled.Particularly, described information collection module 1031 is introduced the information credibility model, this model is according to the credibility of these information of parameter evaluation such as number of connection of the significance level of the affiliated industry of information (being judged by keyword), issuing time, affiliated web site, webpage, and preferentially the high information of credibility handled.Wherein, described information credibility model is a processing module that Reliability of Information is estimated, it is the pre-process module of information collection module, be used for the Reliability of Information that enters the information collection module is estimated and given a mark, important information carried out priority treatment to make things convenient for the information collection personnel.The processing that the information collection module is carried out information is specific as follows:
A) information automation Intelligent treatment
● classification automatically
System supports two kinds of sorting techniques: based on the automatic classification of Statistics with based on the rule classification of semantic rules.Automatic classification technology is applicable to the classification demand that the user is content-based, and the rule classification technology is applicable to the classification demand of user based on keyword, by both combinations, for the user provides the multiclass classification support.The user can select mode a kind of or " two kinds of combinations " the support of classifying according to real needs.
First kind: content-based, as not need manual intervention text automatic classification technology.System provides the classification based training instrument, allows the user to set taxonomic structure according to oneself classification demand and data characteristics voluntarily, and the generating feature template is carried out classification based training automatically.Automatically feedback learning mechanism is supported in classification, can carry out classification model automatically according to user's feedback perfect, thereby progressively increase the accuracy rate of classification.
Second kind: rule-based text classification technology.The writing of rule satisfy with or, logical operation rule such as non-, have the word frequency of setting and count function.The rule definition interface that simultaneity factor is provided convenience, the user can write and regulation rule according to demand, reaches the class object of expection.
● row is heavy automatically
It is disconnected to utilize the similitude of content to arrange major punishment, can the heavy standard of the row of setting, and such as rejecting content 80% information equally.
● field analysis
Information to the Internet collection is carried out field analysis, according to the requirement of message structure, Useful Information is inserted the respective field of record.
B) the artificial treatment function of information
Information artificial treatment function mainly is meant the WEB interface of information work personnel by system, carries out information sifting, editor, processing and sorting, and audit signs and issues etc., and the information after handling is joined the integrated information database from the firsthand information storehouse.Wherein examining the process of signing and issuing realizes in the mode that information work flows.
System management module 1032 is used to be provided with the management of system access right, message processing flow management and the information content, and is specific as follows:
A) user authority management
This function mainly is that the user to system manages, comprise that user's role sets, and corresponding user right distribute.The system user role is divided into system manager, information work personnel, serves and represent three kinds, and user identity can be overlapping.
● the system manager is divided into system manager of province company and the system manager of branch company again, wherein the system manager of province company has authority and the responsibility that whole system is managed, and the system manager of branch company has authority and the responsibility that branch company's subsystem is managed.Specifically comprise: the maintenance of system and data and system's behaviour in service control and subscriber information management and right assignment;
● the information work personnel comprise information processing personnel and information gathering personnel, the information processing personnel have specifying the function that information is handled and information is browsed and stored of classification (theme) and level of confidentiality, the information gathering personnel mainly are entry informations then, and browse the authority of partial information.
● service representative and other business personnels then can the browse queries system informations.
● the system manager can revise information such as user's secret level, browsing information classification.
B) workflow management: system provides platform to make things convenient for the information work personnel that information is managed by workflow.By the workflow customization function, the information work personnel can make workflow by visual edit, and can specify the action of flow nodes, and each information nodes can be specified independently workflow.
Can be different workflows with the different operating task definition, node representative of consumer, organization or role on the workflow follow the flow process of setting, control flows veer between different information points from start to end.The definition of node and the arrangement of workflow can be edited by visualization interface.
C) Content Management:
I., the concrete classification tree view of system is set
Ii. can select to be provided with the rule of collection for each classification tree, for example study collection, information filtering etc.
D) acquisition management: the setting of the source of collection, cycle and various other parameters.The main source type comprises: the Internet, local area network (LAN), assigned catalogue scanning, mailbox collection, BBS
I. the Internet collection is provided with: the network address group that setting is downloaded is that each network address group is set download interval time, downloads parameters such as the number of plies, can be the network address group and adds, deletes, revises the network address of downloading
Ii. the local area network (LAN) collection is provided with: set corporate intranet group or territory.Every group or territory can generate computer-list automatically, and can specifically set those computers needs to gather.Annotate: can only gather shared information.
Iii. assigned catalogue scanning: set the scanning directory group, can be each group interpolation, modification, deletion scanning directory.
Iv. mailbox collection: set and gather the mailbox group, the mailbox that can be each group increase, revise, deletion will be gathered.
V.BBS forum gathers: the forum that customization will be gathered.
Keyword administration module 1033 is used to manage the content and the business rule relevant with keyword of antistop list, realizes the binding of keyword and business rule, and to the management of keyword sales situation.
Statistical analysis module 1034 is used for respectively the information of integrated information database being added up by pre-defined rule, and for example, described pre-defined rule can be:
● the date: by the quantity of all articles of statistical system and article of all categories over sum, year, month
● source: add up the statistics of carrying out respectively by following several sources by sum, year and the moon
● go up the successor: by the different personnel's that upload statistics
● editor: by editor's statistical information quantity, by sum, year and the moon.
● inquiry: by by the quantity rank of Query Information, by total, year and month rank.
User inquiring history module 1035 is used to the problem that shows that certain user inquired about recently, to help to attend a banquet user's demand is analyzed.In addition, this module also can be analyzed the problem that all user inquirings are crossed, and excavates user's demand and focus inquiry.
Keyword retrieval module 1036 is used for retrieving according to keyword, and its specific implementation is referring to Fig. 2.This keyword retrieval module may further include: service information database 203, and it is corresponding to the service information database among Fig. 1 1021; Full-text database 204, it is corresponding to the full-text index database 1023 among Fig. 1.Antistop list, business rule, businessman's contract (Merchant ID, keyword ID, weights) and the user inquiring behavior record of each local network stored and managed to service information database 203; Each local network Business Information and the Internet value-added information of storage through examining in the full-text database 204, and the search key table of safeguarding unity, this search key table and business rule are not got in touch, just in order to improve effectiveness of retrieval.
This keyword retrieval module also comprises: Business Rule Engine 1 and Business Rule Engine 2.Wherein, Business Rule Engine 1, be used for the standardization word segmentation processing is carried out in the inquiry that the user submits to, and in conjunction with the business rule generated query condition in the storage service information database 203, with this querying condition the data of full-text database 204 are searched for then, and carry out sequencing display according to business rule (such as according to the amount of money ordering of buying this keyword) by the result of 2 pairs of search of retrieval ordering engine.Retrieving is as follows:
1) operator imports the keyword coding;
2) judge in service logic engine that if there are a plurality of keywords, the operator selects one;
3) if there is the purchase order information of keyword, return sorted Merchant ID and area information, and revise the weights of keyword according to business rule;
4) return the query history record;
5) in the retrieval ordering engine, return the specifying information of businessman by area information and Merchant ID;
6) utilize keyword to carry out full-text search, return value-added information.
The data storage server 102 that is used for the various data of storage system, preserve the business information data of Best Tone Service by the relevant database of standard, by the data sync gateway data sync is copied to full-text index database (search engine) then, at the full-text index database data are generated full-text index then, attend a banquet and to adopt keyword to inquire about.
Referring to Fig. 3, the present invention mainly comprises several big link (see figure 1)s of information gathering, information processing and information service to the overall process flow of information, main realization internet information and other source-informations are collected, automation, the intellectuality of arrangement, and the platform of information processing, management and service is provided.
Traditional information typing and the search procedure of Best Tone Service is: by IBSS (IntegratedBusiness Support System, the integrated service support system) the sign indicating number information brought of system interface enters number line platform of the artificial treatment of 114 systems, the contents processing that number the line platform is main is that organization, address title etc. is carried out standardization processing, and the information of finishing dealing with is imported the database of 114 systems.After user's success incoming call 114 systems, attend a banquet and in system, utilize keyword that the field of database is inquired about, and Query Result is fed back to the user according to user's demand.
This method is by improving original information typing and search procedure and expanding, be divided into information gathering, information processing and three links of information service, the information gathering link is responsible for handling the information of various information source inputs, then these information is put in the raw information storehouse 301.Wherein, described information gathering for example comprises; By being carried out the WEB typing, the information of employee's typing is stored in the raw information storehouse 301; Perhaps by internet information being carried out automatic information collecting, classification and go information stores after heavy automatically in raw information storehouse 301 automatically; Perhaps the information in SP/CP information or the data with existing storehouse is stored in the raw information storehouse 301 by data interface module.Then, the information processing link is edited and processed processing to the information in the raw information storehouse 301, and by being published in the integrated information database 302 after the audit, search is used for attending a banquet.In addition, the data in the integrated information database 302 are synchronized to Best Tone Service service platform database 303, share for each local network of the whole province and use.The information service link for example provides correct information by modes such as automatic issue, multipath retrieval, information propelling movement customizations for user/service representative then according to user's request.
According to one embodiment of present invention, original information bank 301 and data in the integrated information database 302 preferably are kept in the service information database 1021 of the data storage server 102 among Fig. 1 (different tables of data) among Fig. 3, and the data among Fig. 3 in the Best Tone Service service platform database 303 are kept in the full-text index database 1023 of the data storage server 102 among Fig. 1.
Below in conjunction with the drawings the information gathering and the search system of Best Tone Service business according to the present invention are set forth, but the present invention is not limited to this.One skilled in the art will appreciate that according to the principle of the invention, can make various modifications, improvement, and do not break away from the enclose scope of claim of the present invention the present invention.
Claims (9)
1. an information gathering and inquiry system that is used for the Best Tone Service business, described system comprises information collection server, data storage server and information management and Operation Server at least, wherein,
Described information collection server is used to handle the information of various information source inputs, and adopts Transmission Control Protocol to be connected with Operation Server with information management by IP network;
Described information management and Operation Server are used for managing and provide the information inquiry service to information, antistop list and business rule, and adopt Transmission Control Protocol to be connected with information collection server with data storage server by IP network;
Described data storage server is used for the various data of storage system;
Described data storage server also comprises:
Service information database, it is the relevant database of standard, is used for stored information platform professional required all data, antistop list and business rule, and realizes relevant search function;
The data transaction synchronous gateway is used for the structuring and the destructuring content-data synchronization replication of service information database are arrived the full-text index database; And
The full-text index database is used for the structuring and the destructuring content-data of coming are synchronously carried out full-text index, and the keyword retrieval interface is provided; And
Described system unifies collection and processing, structured message and unstructured information is carried out Syndicating search a plurality of information sources by information collection server, data storage server, information management and Operation Server, and to the result of search according to the business rule demonstration of ranking, carry out information gathering and inquiry with query demand according to the user.
2. information gathering as claimed in claim 1 and inquiry system, wherein,
Described information management can be connected with a plurality of information collection servers with Operation Server, and by Transmission Control Protocol the relevant configured parameter of information gathering is issued information collection server;
Described information collection server also comprises:
The internet information acquisition module is configured according to the parameter that receives by information collection server, is used for the content of website on the Internet is grasped and sending data to information management and Operation Server by Transmission Control Protocol according to the parameter that sets;
Integrated service support system IBSS sign indicating number information-change processing module is used to handle the information of the number of the generation change of bringing from IBSS and organization, address, and these information are formatd processing;
The list input module is used to handle the structured message of manual input, and this module can be according to the field of definition of industry input, and the content of input is carried out format checking;
The task input module, input information is gathered the particular content of personnel's information search task; And
Service provider/content supplier's information input interface module is used to handle the information of bringing from the system of service provider/content supplier, and is the XML form with Data Format Transform.
3. information gathering as claimed in claim 2 and inquiry system, wherein,
Described internet information acquisition module institute operational factor comprises: the login parameters of uniform resource position mark URL, acquisition time, search depth, search range, website, information classification keyword; And
Described internet information acquisition module also is used for dynamically collecting in real time the information of every profession and trade website on the Internet, and add side information by all the other modules and originate, original sign indicating number information is expanded, replenished relevant value-added information, set up the Best Tone Service service information database.
4. information gathering as claimed in claim 1 and inquiry system, wherein, described information management and Operation Server also comprise:
The information collection module is used for the information that information collection server sends is handled;
System management module is used to be provided with the management of system access right, message processing flow management and the information content;
The keyword administration module is used to manage the content and the business rule relevant with keyword of antistop list, realizes the binding of keyword and business rule, and to the management of keyword sales situation;
Statistical analysis module is used for respectively the information of integrated information database being added up by pre-defined rule;
The user inquiring history module is used to the problem that shows that certain user inquired about recently, to help to attend a banquet user's demand is analyzed, and this module also is used for the problem that all user inquirings cross and analyzes, and excavates user's demand and focus inquiry; And
The keyword retrieval module is used for retrieving according to keyword.
5. information gathering as claimed in claim 4 and inquiry system, wherein,
Described information collection module is introduced the information credibility model, this model is according to the credibility of each this information of parameter evaluation of number of connection of the significance level of the affiliated industry of information, issuing time, affiliated web site, webpage, and preferentially the high information of credibility handled.
6. information gathering as claimed in claim 5 and inquiry system, wherein,
Described information credibility model is a processing module that Reliability of Information is estimated, it is the pre-process module of information collection module, be used for the Reliability of Information that enters the information collection module is estimated and given a mark, important information carried out priority treatment to make things convenient for the information collection personnel.
7. information gathering as claimed in claim 5 and inquiry system, wherein, the processing that described information collection module is carried out information comprises:
The information automation Intelligent treatment comprises automatic classification, heavy, the field analysis of row automatically; And
The artificial treatment of information.
8. information gathering as claimed in claim 4 and inquiry system, wherein, described keyword retrieval module also comprises:
Second service information database, corresponding to the described service information database in the described system, be used to store and manage antistop list, business rule, businessman's contract and the user inquiring behavior record of each local network, wherein said businessman contract comprises Merchant ID, keyword, weights; And
Full-text database corresponding to the described full-text index database in the described system, is used to store each local network Business Information and the Internet value-added information through audit.
9. information gathering as claimed in claim 4 and inquiry system, wherein, described keyword retrieval module also comprises:
The Business Rule Engine module is used for the standardization word segmentation processing is carried out in the inquiry that the user submits to, and in conjunction with the business rule generated query condition in the storage service information database, with this querying condition the data of full-text database is searched for then; And
The retrieval ordering engine modules is used for the result of search is carried out sequencing display according to business rule.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2006101542065A CN101146152B (en) | 2006-09-14 | 2006-09-14 | Information collection and search system for telecommunication information station |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2006101542065A CN101146152B (en) | 2006-09-14 | 2006-09-14 | Information collection and search system for telecommunication information station |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101146152A CN101146152A (en) | 2008-03-19 |
CN101146152B true CN101146152B (en) | 2010-10-20 |
Family
ID=39208429
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2006101542065A Active CN101146152B (en) | 2006-09-14 | 2006-09-14 | Information collection and search system for telecommunication information station |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101146152B (en) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8055638B2 (en) * | 2008-12-11 | 2011-11-08 | Microsoft Corporation | Providing recent history with search results |
CN101710927B (en) * | 2009-11-23 | 2013-04-10 | 中国电信股份有限公司 | Method and system for providing information service based on voice platform |
CN102279849A (en) * | 2010-06-09 | 2011-12-14 | 中兴通讯股份有限公司 | Method and system for big data query |
CN102184253A (en) * | 2011-05-30 | 2011-09-14 | 北京搜狗科技发展有限公司 | Method and system used for pushing grabbed and updated messages of network resource |
CN103455605B (en) * | 2013-09-04 | 2016-06-01 | 电子科技大学 | A kind of Intranet environment file depth search method |
CN104735097A (en) * | 2013-12-18 | 2015-06-24 | 青岛海尔空调器有限总公司 | Information collecting method and system |
CN104615696B (en) * | 2015-01-23 | 2018-05-01 | 国家电网公司 | A kind of 95598 knowledge base system and building method |
CN104699777B (en) * | 2015-03-10 | 2019-06-11 | 中国联合网络通信集团有限公司 | The correlating method and system of big data analysis excavation chain of command and service surface |
CN105718605A (en) * | 2016-05-02 | 2016-06-29 | 杨鹏 | Information closed operation system and operation method thereof |
CN105930524A (en) * | 2016-05-28 | 2016-09-07 | 徐志勇 | Big data aggregation method facing quick service |
CN106294847A (en) * | 2016-08-22 | 2017-01-04 | 成都天地网络科技有限公司 | Business operation system based on data mining |
CN107484189B (en) * | 2017-07-27 | 2020-10-16 | 北京市天元网络技术股份有限公司 | LTE data processing system |
CN109376191A (en) * | 2018-09-18 | 2019-02-22 | 深圳壹账通智能科技有限公司 | Financial report data processing method, device, computer equipment and storage medium |
CN110377729A (en) * | 2019-06-11 | 2019-10-25 | 福建奇点时空数字科技有限公司 | A kind of group based on community network model builds mobile body similarity calculating method |
CN111368092B (en) * | 2020-02-21 | 2020-12-04 | 中国科学院电子学研究所苏州研究院 | Knowledge graph construction method based on trusted webpage resources |
CN112052283A (en) * | 2020-08-07 | 2020-12-08 | 上海刀奇智能科技有限公司 | Information consultation service platform based on big data analysis and collection |
CN115687427A (en) * | 2022-11-25 | 2023-02-03 | 贵州电网有限责任公司 | Big data-based information service system and method |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1157958A (en) * | 1996-12-06 | 1997-08-27 | 张熹 | Information service system for networking with international interconnected network |
CN1288322A (en) * | 1999-09-15 | 2001-03-21 | 深圳市华为技术有限公司 | Method of realizing information station switching on management business on intelligent net |
-
2006
- 2006-09-14 CN CN2006101542065A patent/CN101146152B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1157958A (en) * | 1996-12-06 | 1997-08-27 | 张熹 | Information service system for networking with international interconnected network |
CN1288322A (en) * | 1999-09-15 | 2001-03-21 | 深圳市华为技术有限公司 | Method of realizing information station switching on management business on intelligent net |
Also Published As
Publication number | Publication date |
---|---|
CN101146152A (en) | 2008-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101146152B (en) | Information collection and search system for telecommunication information station | |
CN108257043B (en) | Government affair information resource carding and catalog management system and method | |
CN102521337B (en) | Academic community system based on massive knowledge network | |
CN101796795B (en) | Distributed system | |
CN112364223B (en) | Digital archive system | |
JP2008508575A (en) | Aggregation and search methods using ecosystems and related technologies | |
CN102184257A (en) | Unified searching method, device and system | |
CN1963851A (en) | System for managing knowledge facing cooperated work | |
CN109213819A (en) | Information resource sharing system | |
CN101196900A (en) | An Information Retrieval Method Based on Metadata | |
CN111949724A (en) | Intellectual property big data platform | |
CN115757689A (en) | Information query system, method and equipment | |
CN110765233A (en) | Intelligent information retrieval service system based on deep mining and knowledge management technology | |
CN108874722A (en) | A kind of electronic-book reading system | |
CN104462306A (en) | Automatic archive compiling and researching device | |
CN102024207A (en) | Knowledge management system seamlessly combined with office software | |
CN112330299A (en) | Business process management method, device, equipment and storage medium | |
KR20050118182A (en) | Data registration/search support device using a keyword | |
Mythily et al. | Clustering models for data stream mining | |
CN113886397A (en) | A data resource directory system | |
CN115713118A (en) | Power grid operation and maintenance post knowledge pushing method and system | |
JPH0934957A (en) | Analysis method/device for user behavior | |
JP3932764B2 (en) | Electronic service site management system | |
US20030014610A1 (en) | Experience sharing | |
Venkatraman et al. | Intelligent information retrieval and recommender system framework |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |