CN101158949A - Method and system for file items ranking and searching based on aggregation - Google Patents
Method and system for file items ranking and searching based on aggregation Download PDFInfo
- Publication number
- CN101158949A CN101158949A CNA2007100092357A CN200710009235A CN101158949A CN 101158949 A CN101158949 A CN 101158949A CN A2007100092357 A CNA2007100092357 A CN A2007100092357A CN 200710009235 A CN200710009235 A CN 200710009235A CN 101158949 A CN101158949 A CN 101158949A
- Authority
- CN
- China
- Prior art keywords
- docuterm
- file
- attribute
- project
- entity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The file item is the only logo for a file in a file system where the file is positioned. The invention adopts the set way to conduct the classification and searching of the file items. For the system configuration, see figure 3. The system classification process is that a user assigns the attribution relations between a file item and a plurality of entity file item sets in the system, an 'entity file item set storage' of the system stores the assignment into an 'entity file item set database'. The searching process is that the user gives the combination model of the result file item sets to be searched, so as to be transformed into expression by a 'searching interface' and be delivered to an 'expression resolver', the unit can analyze and obtain the entity file item sets required by expression arithmetic and the arithmetical relationship, an 'entity file item set extractor' extracts the file item sets required by the calculation, and is delivered to a 'set arithmetic unit' together with the arithmetical relationship to conduct the set arithmetic and obtain the result file item set, so as to be presented to the user with the 'searching interface'.
Description
Technical field
This invents related technical field is the computer file system field.
Background technology
The hyundai electronics computer system generally is to adopt to be operated carrying out for the mode of file system of system management for the storage of permanent message, and file system no matter which kind of medium it is based upon on, adopt what kind of storage organization mode, substantially all adopt tree structure from user logic.File is exactly to be in one of leaf node position in this tree structure independently information stores entity, and non-leaf node is referred to as catalogue, and each file can carry out unique sign to the path of this document node by the root node by tree structure.This is the file system in general sense the electronic computer system and the notion of file, is called the file and the file system notion of narrow sense at this.
If further abstractly can obtain the file of broad sense and the notion of file system more, be referred to as broad sense file and broad sense file system.The broad sense document definition is an independently information stores entity, and the broad sense file system can be defined as the composite structure that is made of a plurality of broad sense files, and each the broad sense file in the broad sense file system all has unique sign to identify and quote.In the original definition and relation operating system and tree-like institutional framework have been rejected in such definition, make the information organization structures that much do not belong to the regular file system also satisfy this definition.
For example, the goods catalogue system on shopping website can regard the webpage of each commodity as the file of broad sense, regards the URL of the webpage of each commodity as unique sign, can think that then the goods catalogue system is the file system of a broad sense.Again for example, some internal memories independently in the internal memory, every internal memory can identify with unique address and quote, and so also these memory blocks can be regarded as the file system of a broad sense.
But because it is too many to satisfy the system of broad sense file system definition, and between the difference bigger, be difficult to describe and handle with uniform way, so here specify, the specified file system of the present invention clearly comprises two class situations: the first kind is exactly by the file system that operates in the operating system management on the computer system, and second class is exactly to adopt the layered directory system that is realized on the website of WWW browser access.For example, shopping website according to merchandise classification divide step by step form multistage goods catalogue system.The file system of this class file system and operating system management just has very big similarity, also be to organize in logic according to tree-like structure, the URL of webpage and file path have very big similarity, though neither one is this tree structure of showing interface intuitively, its logic essence is tree structure.
The computer system that the present invention is suitable for be defined as have one or more CPU and a unified internal memory can executive routine machinery and equipment.The system that is included in reality in this definition has: mobile computing devices such as PC, various big, kind, minicomputer system, PDA and mobile phone.
The file system that is defined generally all adopts tree structure to organize in front, exists some important disadvantages to be summarized as follows though this tree structure is employed extensively:
(1) need place it in corresponding catalogue following time according to classification under it when file, because a file can belong to a plurality of classifications, and the tree structure that file system adopted now is in order to guarantee that file unique mark in file system is that the uniqueness in path generally only allows file to be placed under the catalogue, this just and file belong to a plurality of classifications and think contradiction.For example: we will remove to distinguish depositary's occurrences in human life message file according to the nationality, divide plurality of sub catalogue, China, the U.S., Russia, Britain, Japan if press country origin ... Deng, below will being placed on people's that this state nationality should be arranged occurrences in human life message file.Troubling that this mode exists is that a people can have a plurality of nationalities, both the Switzerland nationality can be arranged such as a people, also can have U.S. nationality simultaneously.If adopt the existing file system mode, the pairing file of this people and should be present in Switzerland's sub-directory and should have with the U.S. sub-directory again, and this is unallowed in present file system.Further problem is, if say and wonder that those people have the multiple nationality, such as, have Canadian nationality and U.S. nationality simultaneously or have Russia, Britain, Switzerland nationality simultaneously, obviously on traditional tree structure that file system adopted, want to accomplish that this point is pretty troublesome thing.
(2) problem also existing of directory tree system is branch's blast problem that mode combinations causes.Such as, be the file system of storage occurrences in human life message file equally, if say the provincial administrative division plurality of sub catalogue of at first pressing the people place, under these sub-directories, mark off two sub-directories of masculinity and femininity respectively again, under two sub-directories of masculinity and femininity, by age be divided into children, teenager respectively again according to sex, young, middle age, old several sub-directories, its bibliographic structure is seen Fig. 1.The number of the final leaf catalogue of dividing mode will be 34 * 2 * 5=340 like this, if we want that also branchs goes down, the number of final leaf catalogue will be multiplied, can be definite by following formula:
S
F=S
1×S
2…×S
n
Suppose that file system F has n layer, S
FBe whole file system leaf catalogue number, S
1Represent i straton catalogue number.Further problem is again, if want to obtain the words of young message file by the province, can run into the problem that second level catalogue is divided by sex, promptly can't once obtain all youth's in province message file, but to enter twice of same province catalogue, once being the message file that obtains these all young males of province, is for the second time the message file that obtains all women youth of this province.
(3) the fixing dynamic that lacks of mode classification.Follow reference example above, increase words if desired according to educational background classification occurrences in human life message file, need to increase primary school, the middle school, university, sub-directories such as postgraduate, the firstth, badly determine which rank catalogue these sub-directories are placed under, the secondth,, these sub-directories are placed on all to be needed under which rank catalogue to set up and deletion plurality of sub catalogue and the occurrences in human life message file will being moved under the corresponding sub-directory.
(4) can not classify to file and retrieve according to attribute.Then quote example above,, on the prerequisite that does not read the file self-information, can't realize this function on the existing file system architecture if want to specify the message file of searching age people between 10 to 15 years old.
Summary of the invention
Two the most basic notions of the present invention are exactly docuterm and docuterm set.A docuterm may be defined as the unique identification of a broad sense file in the broad sense file system.Being exactly the complete trails of a file in the file system of operating system management, is exactly the URL of a webpage in the directory web site system.The docuterm set is exactly the mathematical meaning upper set of being made up of docuterm.Docuterm a is that the component of docuterm set A then defines a and belongs to A, adopts formula can be expressed as a ∈ A, if instead docuterm a belongs to the docuterm set A, then docuterm a is one of component units of docuterm set.Same docuterm is unique in a docuterm set, but a docuterm can belong to a plurality of different docuterms set simultaneously.
By visits such as docuterm just can read the file in the file system, write, do not spend and be concerned about the concrete memory location of this file, be on the webserver or first-class on earth at local disk.Can not have influence on file itself for the operation that docuterm is belonged to which docuterm set equally yet.Really because the relatively independent relation between docuterm and the document entity, so can adopt the mode of set docuterm is sorted out and to be retrieved.
Quote in the joint the classify example of occurrences in human life message file according to the nationality, solution by the method that the nationality divides the occurrences in human life message file is, at first set up a series of docuterm set of representing country: " China ", " U.S. ", " Russia ", " Britain ", " Japan " etc., be to add docuterm to the corresponding file Item Sets then according to the nationality that the people had that each occurrences in human life message file is put down in writing, if this people has Chinese nationality, then that it is corresponding occurrences in human life message file project is added in " China " this docuterm set, if a people has U.S. nationality, then that it is corresponding occurrences in human life message file project is added in the set of " U.S. " this docuterm, if people has China and U.S. nationality simultaneously the pairing docuterm of this people is added to respectively during " China " and " U.S. " two docuterms gather.
The mode that just can adopt set operation on the superincumbent basis is retrieved the corresponding nationality's who obtains that wants occurrences in human life message file.Can adopt the set operation expression formula to represent to retrieve requirement.For example wanting to retrieve the occurrences in human life message file project with Chinese nationality is to represent that then the set operation expression formula of retrieving requirement is: " China ".When wanting to retrieve the occurrences in human life message file project that not only has the Switzerland nationality but also have a U.S. nationality is to represent that then the set operation expression formula of retrieving requirement is: " Switzerland " ∩ " U.S. ".When wanting to retrieve the occurrences in human life message file project that those have Switzerland nationality or U.S. nationality, adopt expression formula to be: " Switzerland " ∪ " U.S. ".
The fundamental operation of set has:
Self computing of set: the computing set that self computing of set may be defined as input is set A, and the operation result set of output is A.
Ship calculation: the computing set that the intersection of sets computing may be defined as input is set A and set B, and the operation result set of output is C={a|a ∈ A and a ∈ B}, and the intersection of sets computing can be expressed as A ∩ B=C, and C is referred to as the common factor of A and B.
Union: the computing set that the union of sets computing may be defined as input is set A and set B, and the operation result set of output is C={a|a ∈ A or a ∈ B}, and the union of sets computing can be expressed as A ∪ B=C, and C is referred to as the union of A and B.
Difference operation: the computing set that the difference operation of set may be defined as input is set A and set B, and the operation result set of output is
, then Ji He difference operation can be expressed as A-B=C, and C is referred to as the common factor of A and B.
All set operations involved in the present invention can be constructed by making up above basic set arithmetic logic.
Another key concept of the present invention is exactly an attribute.Attribute refers to the characteristic information of certain aspect of log file project, can be by the people for giving.For example go up the occurrences in human life message file project of using in the joint background technology, can give certain attribute for it as required, such as can be for this occurrences in human life message file project is provided with an age attribute, value be according to the people's of occurrences in human life message file correspondence age.
A docuterm can be given some different attributes and corresponding attribute value as required.Same docuterm set can be endowed some different attributes and corresponding attribute span.With reference to the application scenarios of mentioning in the tree-like structural disadvantages of existing file system (2) in the last joint background technology, set up one and represent old docuterm set, give its age this attribute, this attribute span is set to more than or equal to 50.The docuterm that then joins this document project set inside must have this attribute of age, and the attribute value must be consistent with span of this set age attribute, promptly more than or equal to 50.Docuterm can have the attribute that affiliated docuterm set does not possess, and the occurrences in human life message file project that for example belongs to old docuterm set can have this attribute of sex and the corresponding property value that characterizes sex.
At the tree structure shortcoming (2) of the existing file system that mentions in the last joint, can take following measure:
Setting up the corresponding file project set according to provincial administrative region is respectively: " Beijing ", " Shanghai City ", " Henan Province ", " Hebei province " etc., set up " male sex " and " women " two docuterm set according to sex again, divide according to the age again and set up " children ", " teenager ", " youth ", " middle age " set such as " old age " docuterm.For an occurrences in human life message file project, affiliated provincial administrative region according to its correspondence joins in the corresponding civilian project part set, join in the corresponding docuterm set of distinguishing with sex according to its corresponding sex again, join among the relevant docuterm set of corresponding age according to affiliated age level again.Adopt set operation dynamically to make up these set as required in the time of retrieval, for example want to retrieve the old occurrences in human life message file in all Pekinese, only need ask the common factor in " Beijing " and " old age " to get final product, employing set operation expression formula is described retrieval request and is: " Beijing " ∩ " old age ".The docuterm number of sets of the required setting of this mode be an arithmetic add with form be 34+2+5=41, clearly comparing with the multiplication number of the required leaf catalogue of traditional file systems has had huge reduction.
At the shortcoming (3) of the existing file system tree structure of mentioning in the last joint, can take following measure:
Superincumbent set is divided increases " primary school ", " middle school ", " university ", the set of " postgraduate " docuterm again on the basis, people's 1 thing message file project is joined among the corresponding file project set according to the educational background of its correspondence again.
Description of drawings
Fig. 1 be existing occurrences in human life message file system tree structure because branch's blast that mode combinations causes;
Fig. 2 is the set membership between the occurrences in human life message file project set;
Fig. 3 is the logic diagram that adopts a kind of system of this method;
Embodiment
In order to simplify narration, " system " speech refers in the content in the back, adopts the system of our bright method mode.In the system that adopts the inventive method, there are two class file project sets.One class is the entity file project set, and this class file project set is to be made of concrete docuterm, attribute and corresponding attribute-value ranges that the docuterm that system constitutes for its record and these docuterms thereof have.Another kind of is the view file project set, this class file project set is to comprise that by the alternative document project set entity file project set and view file project set adopt the mode of set operation or the mode of specified attribute value scope or both combinations to combine, and system only writes down the operation expression of this docuterm set.In system, no matter be that entity file project set or view file project set all adopt unique title to identify and quote, this is for obscuring of adopting when adopting set operation composition file project set that unique title specified file project set avoids specifying and conflicts.
There is set membership between the entity file project set, if designated entities docuterm set A is the subclass of entity file project set B, then all literary compositions that belong to A also belong to B with project, and A inherits all properties of B and corresponding attribute span in the set attribute span of B.For example, in system, set up " age " entity file project set, giving its age attribute and the age span is set is 0≤age, set up " old age " entity file project set, specifying " old age " is the subclass at " age ", then " old age " inherited the age attribute, the scope that " old age " age further is set is 50<age, this attribute-value ranges is in " age " age span.All docuterms that belong to " old age " all just have age attribute and value being provided with in the scope of " old age ", also belong to simultaneously among " age " according to these docuterms of definition of front subset relation notion.If entity file project set A is the subclass of entity file project set B, can claim that so conversely B is the superset of A, can claim that also entity file project set A is that the subclass of entity file project set B exists set membership.Set membership is transmitted between the entity file project set in system, be that entity file project set A is the subclass of entity file project set B, entity file project set B is the subclass of entity file project set C simultaneously, then entity file project set A also is the subclass of entity file project set C, the set membership of A and B is called direct set membership, and the set membership of A and C is called indirect set membership.
The docuterm that belongs to an entity file project set has two classes, one class is the docuterm that directly belongs to this document project set, another kind of is these all subclass of entity file project set, comprises direct subclass and indirect subclass, affiliated All Files project.When system constituted at record entity file project set, only record directly belonged to the docuterm of this document project set, can calculate according to subset relation for its docuterm of subclass obtain.Summing up above system comprises for the information that the entity file project set writes down: the attribute and the corresponding property value that directly belong to the docuterm of this entity file project set and have; Attribute that this entity file project set is had and corresponding range of attributes; Give the direct set membership between entity file project set and other entity file project set.
In order to clearly demonstrate system operating mode, the example that still adopts the above-mentioned occurrences in human life message file to sort out illustrates.Supposing need be according to the people's of occurrences in human life message file correspondence location, sex, and at the age, academic attribute is sorted out the occurrences in human life message file.At first set up a series of entity file project set, the set membership between them is seen Fig. 2.
The detailed process of sorting out is exactly the location of user according to the people of a concrete occurrences in human life message file project correspondence, sex, at the age, educational background for belonging to which corresponding file project set, is got off by system this document project mark with this designated recorder that the user does.Such as, for certain concrete occurrences in human life message file project, the user knows that the people of this occurrences in human life file logging is from the Zhangzhou City, Fujian Province, sex is the male sex, and the age is to belong to young in 32 years old, and educational background is the Master degree candidate, add this document project to " Zhangzhou City " respectively according to these information, " male sex ", " youth " is in the set of " Master degree candidate " docuterm.Because the set membership between set, this document project are equivalent to also be added among all father file project sets of these docuterm set.Such as, the upper level docuterm set " Fujian Province " of " Zhangzhou City " is added docuterm to " Zhangzhou City " and quite simultaneously docuterm is added in " Fujian Province " this docuterm set.Because set membership is transmitted, this document project is equivalent to add among direct or indirect higher level's docuterm set of all " Zhangzhou City ".But system only writes down this document project belongs to " Zhangzhou City " this relation, for this document project and " Zhangzhou City " immediate superior or the attaching relation of higher level's docuterm set indirectly can be obtained by the set membership deduction between these docuterms set and " Zhangzhou City ".
The detailed process of retrieval is: the user submits to system by the set operation integrated mode of set operation expression formula or the set of other mode designated result docuterms, system is according to the formation of the related entity file project set of this pattern query, carry out set operation again and obtain the destination file project set and return to the user, the user obtains the destination file project set and just can file be conducted interviews according to path or the URL that the docuterm in this document project set is put down in writing.On basis mentioned above, all have the pairing occurrences in human life message file of the people project of postgraduate's educational background if the user need retrieve Fujian Province.The expression formula of then submitting to system is: " Fujian Province " ∩ " postgraduate ".The formation of " Fujian Province " docuterm set is at first calculated by system according to this expression formula, the docuterm that belongs to this document project set has, the docuterm that directly belongs to " Fujian Province " that writes down in the system and belong to docuterm in the docuterm set in " Fujian Province " by the set set membership, " Zhangzhou City ", the docuterm set in the next stage docuterm set in " Fujian Province " such as " Xiamen City " is all in this scope.System is again according to calculate the set of " postgraduate " docuterm with quadrat method.Then by asking " Fujian Province " and " postgraduate " two docuterm intersection of sets collection to obtain final destination file project set.
It is worthy of note that for the mode of user's designated result docuterm set in the retrieving, the most direct and the most basic is exactly to adopt the set operation expression formula.No matter it adopts the operational symbol of what outward appearance for the set operation expression formula, and which type of syntactic structure, the blue formula of head sea for example, expressed set operation essence is constant.For other modes, come operation relation between named aggregate such as the input of adopting friendly graphical interfaces to accept the user, these all can be considered the equivalence of set operation expression formula.
More than classification of Jie Shaoing and retrieving do not relate to attribute, introduce the process that adopts attribute to sort out and retrieve below.Here this attribute of acceptable age as an example.At first giving age attribute span for " age " this docuterm set is set to more than or equal to 0.Set membership between the entity file project set of saying according to the front and the restriction relation between the attribute, " children ", this attribute of age has also been inherited in set such as " teenager " next stage docuterm, so gather the scope that corresponding actual the range of age is provided with these docuterm set age attribute according to these by the user, " children " age span being set here is 0≤age<12, it is 12≤age<18 that " teenager " age span is set, it is 18≤age<38 that " youth " age span is set, it is 38≤age<50 that " middle age " age span is set, and it is 50≤age that " old age " age span is set.
The classification process that has attribute is: for certain concrete occurrences in human life message file project, the user knows the people of this occurrences in human life file logging from the Zhangzhou City, Fujian Province, and sex is the male sex, and the age is 32 years old, and educational background is the Master degree candidate.Still sort out about area, sex, educational background according to the example of its face.The docuterm set of at first specifying the higher level with age attribute for the classification at age i.e. " age ", then by specifying the age attribute value 32 of this document project, system adds this document project to the next stage docuterm set at " age " and the age value that the age span can comprise above-mentioned docuterm according to this value, have only " youth " to meet this requirement here, its age attribute span is 18≤age<38.Here it is to be noted, more than this process carry out with a kind of recursive mode, if " youth " further is subdivided into according to the age value: the docuterm set of 18≤age<25 and the docuterm set of 25≤age<38, then this document project is the most directly added among the docuterm set of 25≤age<38.
The retrieving that has attribute is: for example, the user wants retrieval from Fujian Province, and the age is the male sex's in 18-40 year an occurrences in human life message file, adopt expression formula describe retrieval require be: " Fujian Province " ∩ (" age " 18≤age≤40).After system receives the retrieval requirement, at first calculate the set of " Fujian Province " docuterm, calculate the set of " age " docuterm again, and then filter out not satisfying the docuterm that the age attribute-value ranges sets in " age " docuterm set, again this docuterm set and " Fujian Province " are carried out intersection operation and obtain final destination file project set.
The effect of view file Item Sets in system is to play the effect of simplifying user's input.When the user needs often certain specific docuterm set integrated mode to be retrieved, can with the expression formula of this pattern and unique title be associated and by system log (SYSLOG), set up a view file project set, when retrieving like this name of this view file project set is submitted to system, system can be converted into its corresponding integrated mode and calculate the destination file project set.For example, want often to retrieve the occurrences in human life message file that has university's educational background among the youth, then set up the view file project set of " youth _ university " by name, its association mode expression formula is: " youth " ∩ " university ".The view file project set also can be used as the computing set and participates in the set combination once more, for example, want to retrieve the young people's with university's educational background in Fujian Province occurrences in human life message file on the basis in front, can adopt expression formula: " youth _ university " ∩ " Fujian Province ".
According to based on platform and the technology that adopted, there is multiple implementation in system.It is following that what introduce is a feasible scheme of realization system.See accompanying drawing 3 according to this scheme system main composition.Below according to each component units of workflow introducing system and the function thereof of system.
At first classification flow process.Sorting out flow process starts from the user and provides attaching relation between the one group of docuterm that will sort out and these docuterms and the set of system entity docuterm by sorting out the interface.Attaching relation can be determined by three kinds of modes.First kind is directly docuterm to be labeled as to belong to the set of certain docuterm, and second kind is by setting a property and the mode of corresponding property value, being about to docuterm and giving all properties of the docuterm set that will be referred to and corresponding property value is set.Last a kind of mode is the combination of preceding dual mode, promptly add certain docuterm set to by the mode of direct mark, add the set of another docuterm to by the mode that sets property again, it should be noted that a docuterm can only add this document project set to according to mark or one of them the mode of setting a property for same docuterm set.Attribute configuration information and the attaching relation thereof that docuterm and docuterm will be sorted out in the interface sends " docuterm set storer " to, realized these information are write in " entity file project set database " by this module." entity file project set database " be exactly in the permanent saved system entity file project set form and attribute setting and the device that is included in docuterm property value wherein.
Retrieving at first is to handle search interface by the user to propose retrieval request, and then " search interface " is converted to expression formula with user's retrieval request and sends " expression parsing device " to." expression parsing device " draws operation relation between the set of the required entity file project set of operation result collection and these entity files by analytical expression, it will submit to " entity file project set extraction apparatus " for the entity file project set, obtain the required entity file Item Sets merging of computing by this device inquiry " entity file project set database " and pass to " set operation device ", " expression parsing device " passes to " set operation device " with the operation relation between the entity file set simultaneously, carry out set operation by this device and obtain the destination file Item Sets and merge and to pass to " search interface ", adopt suitable manner to present to the user by " search interface ".For containing the view file project set in the expression formula, system finds and is sent to " expression parsing device " with the definition of this view file project set from " view file project set definition storehouse " and resolves.
Claims (21)
1. this method is classification and the search method based on the docuterm of gathering that realizes on computer system, this method comprises: classification process is characterized in that: set up attaching relation between docuterm and the existing file project set by the direct appointment or the mode of giving attribute and its corresponding property value or both combinations being set;
Retrieving is characterized in that: adopt the mode of named aggregate operational pattern or specified attribute value scope or both combinations to obtain the destination file project set from the existing file project set.
2. require described method as right 1, its feature also is: docuterm refers to a file unique sign in residing file system.
3. require described method as right 2, its feature also is: docuterm comprises two classes at least, and a class is the file path of the file in the file system of computer operating system management, and another kind of is the URL that adopts the Website page of WWW browser access.
4. require described method as right 1, it is characterized in that: the set that the docuterm set is made up of n different docuterms, n is the integer more than or equal to 0, if docuterm a is the component units of docuterm set A then this document project belongs to this document project set, if instead docuterm a belong to the docuterm set A then this document project a must be the component units of this document project set A.
5. require described method as right 4, its feature also is: a docuterm can only occur once at most in a docuterm set, but a docuterm can belong to a plurality of different docuterms set.
6. require described method as right 1, it is characterized in that: a docuterm can be endowed n attribute, for the attribute of giving corresponding property value must be set, and n is the integer more than or equal to 0.
7. require described method as right 1, it is characterized in that: a docuterm set can be endowed n attribute, for the attribute of giving corresponding attribute-value ranges must be set, and n is the integer more than or equal to 0.
8. require described method as right 7, it is characterized in that: a docuterm set has certain attribute and corresponding attribute-value ranges, then belong to this document project set and must have this attribute, and corresponding property value is in set this attribute-value ranges of docuterm set, but the docuterm that docuterm can have its ownership is gathered not available attribute.
9. require described method as right 1, its feature also is: in the system of this method of employing, the existing file project set refers to the entity file project set, system directly belongs to the attribute that docuterm and each docuterm had of this entity file project set and corresponding attribute value for its record, and system also writes down the direct set membership between other entity file project set in attribute that the entity file project set directly is endowed and corresponding attribute-value ranges and this entity file project set and the system.
10. require described method as right 9, its feature also is: in the system of this method of employing, if subclass that the entity file project set is another entity file project, then all docuterms that belong to the former also belong to the latter, and the former inherits all properties that all latter have, and corresponding attribute-value ranges is in the attribute-value ranges of the former correspondence.
11. require described method as right 10, its feature also is: in the system of this method of employing, set membership is transmitted, be that entity file project set A is the subclass of entity file project set B, entity file project set B is the subclass of entity file project set C simultaneously, then entity file project set A also is the subclass of entity file project set C, and the set membership of A and B is called direct set membership, and the set membership of A and C is called indirect set membership.
12. require described method as right 11, its feature also is: in the system of this method of employing, the docuterm that belongs to an entity file project set has two classes, one class is the docuterm that directly belongs to this document project set, another kind of is that these all subclass of entity file project set comprise direct subclass and indirect subclass, the All Files project that is belonged to.
13. require described method as right 1, its feature also is: in the system of this method of employing, first kind of mode that can adopt for a file items ranking process is the attaching relation between n the entity file project set of specifying in this document project and the system, and n is the integer greater than 0.
14. require described method as right 1, its feature also is: in the system of this method of employing, the second way that can adopt for a file items ranking process is at first to give n attribute of this document project and corresponding property value is set, n is the integer greater than 0, for each attribute is specified the entity file project set with this attribute, belong to according to this attribute value of this document project then and comprise entity file project set that the front is specified and directly subclass and this attribute span among m the entity file project set near this attribute value of this document project in the subset range indirectly with this attribute, m is the integer more than or equal to 0, and does not exist the father and son to collect relation between this m entity file project set.
15. require described method as right 1, its feature also is: in the system of this method of employing, the third mode that can adopt for a file items ranking process is combining of the attaching relation between m the entity file project set of directly specifying in this document project and the system and n this dual mode of property value that this document project is set, m, n is the integer greater than 0, but to the ownership of same entity file project set can only adopt direct appointment its for affiliated entity file project set or a kind of mode in setting property carry out.
16. require described method as right 1, its feature also is: in the system of this method of employing, first kind of mode that retrieving adopts is by the set operation integrated mode of designated result docuterm set, obtains the destination file project set by system in combination existing file project set; The set fundamental operation comprises self computing, intersection of sets computing, the union of sets computing of set, the difference operation of set, by the synthetic set operation of these basic set operation groups all within this method specified scope.
17. require described method as right 1, its feature also is: in the system of this method of employing, the second way that retrieving adopts is: to specify the attribute-value ranges of its n that has attribute, n be greater than 0 and be less than or equal to the integer of the number of the attribute that this entity file project set had for an entity file project set.
18. require described method as right 1, its feature also is: in the system of this method of employing, the third mode that retrieving adopts is: the combination of specified attribute value and set operation pattern dual mode, promptly can at first obtain a docuterm set by n the attribute and the corresponding attribute span of specifying an entity file project set, n is the integer greater than 0, by specifying the set operation relation of this document project set and the set of other docuterm, determine the destination file project set again by this mode.
19. require described method as right 1, its feature also is: in the system of this method of employing, in retrieving, the most basic mode is exactly an operation expression in the expression way of designated result docuterm set.
20. require described method as right 1, its feature also is: in the system of this method of employing, the view file project set is the integrated mode of other docuterm set in the system, and system adopts operation expression record view file project set.
21. require described method as right 9,20, its feature also is: in the system of this method of employing, each docuterm set all has a unique title corresponding with it.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2007100092357A CN101158949A (en) | 2007-07-20 | 2007-07-20 | Method and system for file items ranking and searching based on aggregation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2007100092357A CN101158949A (en) | 2007-07-20 | 2007-07-20 | Method and system for file items ranking and searching based on aggregation |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101158949A true CN101158949A (en) | 2008-04-09 |
Family
ID=39307051
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2007100092357A Pending CN101158949A (en) | 2007-07-20 | 2007-07-20 | Method and system for file items ranking and searching based on aggregation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101158949A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011000144A1 (en) * | 2009-06-30 | 2011-01-06 | Shi Wen | Aggregation method of information items set directory and system thereof |
CN109299055A (en) * | 2018-08-24 | 2019-02-01 | 天津美腾科技有限公司 | File management method and device |
-
2007
- 2007-07-20 CN CNA2007100092357A patent/CN101158949A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011000144A1 (en) * | 2009-06-30 | 2011-01-06 | Shi Wen | Aggregation method of information items set directory and system thereof |
CN109299055A (en) * | 2018-08-24 | 2019-02-01 | 天津美腾科技有限公司 | File management method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210397653A1 (en) | Method and system for identifying and discovering relationships between disparate datasets from multiple sources | |
Yin et al. | Building taxonomy of web search intents for name entity queries | |
Szomszor et al. | Correlating user profiles from multiple folksonomies | |
KR20090010185A (en) | Single and Multiple Taxonomy Management Method and System | |
US20130157234A1 (en) | Storyline visualization | |
US20100293174A1 (en) | Query classification | |
US20090204590A1 (en) | System and method for an integrated enterprise search | |
Chen | RETRACTED ARTICLE: Research on personalized recommendation algorithm based on user preference in mobile e-commerce | |
US20120016863A1 (en) | Enriching metadata of categorized documents for search | |
Yellepeddi et al. | Accurate approach towards efficiency of searching agents in digital libraries using keywords | |
KR101088710B1 (en) | Computer-readable recording media recording interactions between online community users based on online community post retrieval methods, devices, and programs that run them | |
Haustein et al. | A multidimensional analysis of Aslib proceedings–using everything but the impact factor | |
KR101823463B1 (en) | Apparatus for providing researcher searching service and method thereof | |
Sun et al. | A hybrid approach for article recommendation in research social networks | |
Yang et al. | K-truss community most favorites query based on top-t | |
Dattolo et al. | VisualBib: a novel Web app for supporting researchers in the creation, visualization and sharing of bibliographies | |
Hayes et al. | An analysis of bloggers, topics and tags for a blog recommender system | |
Yu et al. | Web of conferences: a conference knowledge graph | |
Leung | Big data mining applications and services | |
Gresham et al. | Improving browsability of archive catalogues using Web 2.0 | |
CN101158949A (en) | Method and system for file items ranking and searching based on aggregation | |
Nasir Uddin et al. | The implementation of faceted classification in web site searching and browsing | |
Kahng et al. | Ranking objects by following paths in entity-relationship graphs | |
Zemede et al. | Personalized search with editable profiles | |
Kern et al. | Recommending scientific literature: Comparing use-cases and algorithms |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |