WO2001067280A2 - Method for a knowledge model - Google Patents
Method for a knowledge model Download PDFInfo
- Publication number
- WO2001067280A2 WO2001067280A2 PCT/US2001/006457 US0106457W WO0167280A2 WO 2001067280 A2 WO2001067280 A2 WO 2001067280A2 US 0106457 W US0106457 W US 0106457W WO 0167280 A2 WO0167280 A2 WO 0167280A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- item
- network
- recited
- utilizing
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 72
- 230000008520 organization Effects 0.000 claims description 20
- 229940079593 drug Drugs 0.000 claims description 17
- 239000003814 drug Substances 0.000 claims description 17
- 150000001875 compounds Chemical class 0.000 claims description 15
- 238000004590 computer program Methods 0.000 claims description 12
- 230000001225 therapeutic effect Effects 0.000 claims description 12
- 201000010099 disease Diseases 0.000 claims description 10
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 10
- 230000008859 change Effects 0.000 claims description 6
- 239000000284 extract Substances 0.000 claims description 6
- 238000012544 monitoring process Methods 0.000 claims description 5
- 238000013507 mapping Methods 0.000 claims description 3
- 230000006870 function Effects 0.000 description 89
- 230000008569 process Effects 0.000 description 33
- 238000012545 processing Methods 0.000 description 22
- 238000011161 development Methods 0.000 description 12
- 230000018109 developmental process Effects 0.000 description 12
- 238000011160 research Methods 0.000 description 12
- 238000013461 design Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 10
- 239000000919 ceramic Substances 0.000 description 9
- 230000007246 mechanism Effects 0.000 description 9
- 239000002585 base Substances 0.000 description 8
- 230000006399 behavior Effects 0.000 description 8
- 238000003490 calendering Methods 0.000 description 7
- 229960000830 captopril Drugs 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- -1 captopril compound Chemical class 0.000 description 5
- 230000002526 effect on cardiovascular system Effects 0.000 description 5
- 238000005538 encapsulation Methods 0.000 description 5
- 230000010354 integration Effects 0.000 description 5
- 238000004891 communication Methods 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- FAKRSMQSSFJEIM-RQJHMYQMSA-N captopril Chemical compound SC[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O FAKRSMQSSFJEIM-RQJHMYQMSA-N 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 208000019553 vascular disease Diseases 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 230000000712 assembly Effects 0.000 description 2
- 238000000429 assembly Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 101000666896 Homo sapiens V-type immunoglobulin domain-containing suppressor of T-cell activation Proteins 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- 102100038282 V-type immunoglobulin domain-containing suppressor of T-cell activation Human genes 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 239000003637 basic solution Substances 0.000 description 1
- 238000011953 bioanalysis Methods 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 238000001647 drug administration Methods 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 238000012912 drug discovery process Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- IJJVMEJXYNJXOJ-UHFFFAOYSA-N fluquinconazole Chemical compound C=1C=C(Cl)C=C(Cl)C=1N1C(=O)C2=CC(F)=CC=C2N=C1N1C=NC=N1 IJJVMEJXYNJXOJ-UHFFFAOYSA-N 0.000 description 1
- 230000008570 general process Effects 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 238000005461 lubrication Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 231100000279 safety data Toxicity 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012956 testing procedure Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 239000006163 transport media Substances 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/313—Selection or weighting of terms for indexing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H70/00—ICT specially adapted for the handling or processing of medical references
- G16H70/40—ICT specially adapted for the handling or processing of medical references relating to drugs, e.g. their side effects or intended usage
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H70/00—ICT specially adapted for the handling or processing of medical references
- G16H70/60—ICT specially adapted for the handling or processing of medical references relating to pathologies
Definitions
- the present invention relates to information management and retrieval and more particularly to models for obtaining, managing, and providing information from a plurality of sources.
- An “information space” is the set of all sources of information that is available to a user at a given time or setting.
- a user is forced to spend too much "overhead” on discovering and remembering where different information is located (e.g., online technical books, manual pages ("manpages"), release notes, help information, etc.).
- the user also spends a large amount of time remembering how to find information in each delivery mechanism.
- it is difficult for the user to remember where potentially relevant information might be, and the user is forced to jump between multiple different online tools to find it.
- a method for a knowledge model is provided.
- a network is utilized to access or one or more sources.
- Information from the source is then extracted utilizing the network.
- the format of the extracted information is converted into a common format.
- An index is generated for the information utilizing a knowledge model.
- the generated index is then stored in a database.
- the knowledge model may include a plurality of inter- associated items.
- generation of the index may include associating the extracted information with one or more of the items of the model, and then mapping the extracted information to the associated item.
- the items of the knowledge model may include a therapeutic area item, a target item, disease item, a scientist item, an organization item, a patent item, a compound item, a literature item, a FDA approval item, and/or a drug item.
- the knowledge model may also provide an organizational structure to the generated index.
- the extracted information may include pharmaceutical information.
- the sources may include one or more an internal sources, and/or an external sources.
- the network may be capable of communicating using TCP/IP protocol.
- a user may be permitted to access the database utilizing the network to retrieve the stored index.
- a query may be received utilizing the network whereupon, the index may be searched for information matching the query to thereby permit retrieval of the matching information utilizing the network.
- Figure 1 is a schematic diagram of a hardware implementation of one embodiment of the present invention.
- Figure 2 is a schematic diagram of a knowledgebase framework in accordance with an embodiment of the present invention.
- Figure 3 is a schematic diagram of the knowledgebase framework illustrating how the index may be used as a user's primary gateway to a wide variety of information sources in accordance with an embodiment of the present invention
- Figure 4 is a schematic diagram of an illustrative knowledge model in accordance with an embodiment of the present invention.
- Figure 5 illustrates a flowchart for a process for a knowledgebase framework in accordance with an embodiment of the present invention
- Figure 6 illustrates a flowchart for a process for a knowledge model in accordance with an embodiment of the present invention
- Figure 7 is a schematic illustration of a log in frame for permitting a user to log into the knowledgebase framework utilizing a network in accordance with an embodiment of the present invention
- Figure 8 is a schematic illustration of a search and browse frame of the decision support application for permitting a user browse the knowledge model-based index in accordance with an embodiment of the present invention
- Figure 9 is a schematic illustration of the items of the knowledge model-base index displayed in the search and browse frame upon selection of the cardiology/vascular diseases item in Figure 8 in accordance with an embodiment of the present invention
- Figure 10 is a schematic illustration of the items of the knowledge model-base index displayed in the search and browse frame upon selection of the ACE biological target item in Figure 9 in accordance with an embodiment of the present invention
- Figure 11 is a schematic illustration of illustrative items of the knowledge model-base index displayed in the search and browse frame upon selection of the captopril compound item in accordance with an embodiment of the present invention
- Figure 12 is a schematic illustration of a browser frame which displays information from knowledge source relating to an item upon selection of the visit source selection in accordance with an embodiment of the present invention
- Figure 13 is a schematic illustration of an exemplary display of the display bar upon selection of a scientist/person item in accordance with an embodiment of the present invention
- Figure 14 is a schematic illustration of illustrative items of the knowledge model-base index displayed in the search and browse frame upon selection of a patent item in accordance with an embodiment of the present invention
- Figure 15 is a schematic illustration of a browser frame which displays information from knowledge source relating to the selected patent item of Figure 14 upon selection of the visit source selection in accordance with an embodiment of the present invention
- Figure 16 illustrates a recent news frame that the user may access after logging on to the network in accordance with an embodiment of the present invention
- Figure 17 illustrates a flowchart for a process for monitoring information in a knowledgebase framework in accordance with an embodiment of the present invention
- Figure 18A is a schematic illustration of a process map displayed in a research frame for use in a knowledgebase framework in accordance with an embodiment of the present invention
- Figure 18B is a schematic illustration of an illustrative template for a process map displayed in the research frame in accordance with an embodiment of the present invention
- Figure 18C illustrates an inbox for managing email messages displayed in an organizer frame for use in a knowledgebase framework in accordance with an embodiment of the present invention
- Figure 18D illustrates a calendar for managing events and appointments of a user displayed in an organizer frame for use in a knowledgebase framework in accordance with an embodiment of the present invention
- Figure 19 is a schematic diagram of an exemplary system architecture in accordance with an embodiment of the present invention.
- FIG. 20 is a flowchart of the system in accordance with an embodiment of the present invention.
- Figure 21 is a flowchart of a parsing unit of the system in accordance with an embodiment of the present invention.
- Figure 22 is a flowchart for pattern matching in accordance with an embodiment of the present invention.
- Figure 23 is a flowchart for a search unit in accordance with an embodiment of the present invention.
- FIG. 24 is a flowchart for overall system processing in accordance with an embodiment of the present invention.
- Figure 25 is a flowchart of topic processing in accordance with an embodiment of the present invention
- Figure 26 is a flowchart of meeting record processing in accordance with an embodiment of the present invention
- Embodiments of the present invention show how the concept of knowledge integration can be applied in the business world, especially in the pharmaceutical industry. Aspects of the present invention may be targeted for users active in the drug discovery process such as scientist and other researchers. Embodiments of the present invention may use knowledge integration technology to semantically integrate the knowledge capital located in various isolated repositories in the Internet. The information from this repositories are extracted and are classified based on various facets such as, for example drug, chemical compound, biological target, scientist, etc. As the results, embodiments of the present invention can graphically show users how the various facets of the information are related to each other.
- FIG. 1 An embodiment of a system in accordance with the present invention is preferably practiced in the context of a personal computer such as an IBM compatible personal computer, Apple Macintosh computer or UNIX based workstation.
- a representative hardware environment is depicted in Figure 1, which illustrates a typical hardware configuration of a workstation in accordance with a preferred embodiment having a central processing unit 110, such as a microprocessor, and a number of other units interconnected via a system bus 112.
- a central processing unit 110 such as a microprocessor
- the workstation shown in Figure 1 includes a Random Access Memory (RAM) 114, Read Only Memory (ROM) 116, an I/O adapter 118 for connecting peripheral devices such as disk storage units 120 to the bus 112, a user interface adapter 122 for connecting a keyboard 124, a mouse 126, a speaker 128, a microphone 132, and/or other user interface devices such as a touch screen (not shown) to the bus 112, communication adapter 134 for connecting the workstation to a communication network (e.g., a data processing network) and a display adapter 136 for connecting the bus 112 to a display device 138.
- a communication network e.g., a data processing network
- display adapter 136 for connecting the bus 112 to a display device 138.
- the workstation typically has resident thereon an operating system such as the Microsoft Windows NT or Windows/95 Operating System (OS), the IBM OS/2 operating system, the MAC OS, or UNIX operating system.
- OS Microsoft Windows NT or Windows/95 Operating System
- IBM OS/2 operating system the IBM OS/2 operating system
- MAC OS the MAC OS
- UNIX operating system a preferred embodiment is written using JAVA, C, and the C++ language and utilizes object oriented programming methodology.
- Object oriented programming (OOP) has become increasingly used to develop complex applications. As OOP moves toward the mainstream of software design and development, various software solutions require adaptation to make use of the benefits of OOP. A need exists for these principles of OOP to be applied to a messaging interface of an electronic messaging system such that a set of OOP classes and objects for the messaging interface can be provided.
- OOP is a process of developing computer software using objects, including the steps of analyzing the problem, designing the system, and constructing the program.
- An object is a software package that contains both data and a collection of related structures and procedures.
- OOP Since it contains both data and a collection of structures and procedures, it can be visualized as a self-sufficient component that does not require other additional structures, procedures or data to perform its specific task. OOP, therefore, views a computer program as a collection of largely autonomous components, called objects, each of which is responsible for a specific task. This concept of packaging data, structures, and procedures together in one component or module is called encapsulation.
- OOP components are reusable software modules which present an interface that conforms to an object model and which are accessed at run-time through a component integration architecture.
- a component integration architecture is a set of architecture mechanisms which allow software modules in different process spaces to utilize each others capabilities or functions. This is generally done by assuming a common component object model on which to build the architecture. It is worthwhile to differentiate between an object and a class of objects at this point.
- An object is a single instance of the class of objects, which is often just called a class.
- a class of objects can be viewed as a blueprint, from which many objects can be formed.
- OOP allows the programmer to create an object that is a part of another object.
- the object representing a piston engine is said to have a composition-relationship with the object representing a piston.
- a piston engine comprises a piston, valves and many other components; the fact that a piston is an element of a piston engine can be logically and semantically represented in OOP by two objects.
- OOP also allows creation of an object that "depends from" another object. If there are two objects, one representing a piston engine and the other representing a piston engine wherein the piston is made of ceramic, then the relationship between the two objects is not that of composition. A ceramic piston engine does not make up a piston engine.
- the object representing the ceramic piston engine is called a derived object, and it inherits all of the aspects of the object representing the piston engine and adds further limitation or detail to it.
- the object representing the ceramic piston engine "depends from” the object representing the piston engine. The relationship between these objects is called inheritance.
- the object or class representing the ceramic piston engine inherits all of the aspects of the objects representing the piston engine, it inherits the thermal characteristics of a standard piston defined in the piston engine class.
- the ceramic piston engine object overrides these ceramic specific thermal characteristics, which are typically different from those associated with a metal piston. It skips over the original and uses new functions related to ceramic pistons.
- Different kinds of piston engines have different characteristics, but may have the same underlying functions associated with it (e.g., how many pistons in the engine, ignition sequences, lubrication, etc.).
- a programmer would call the same functions with the same names, but each type of piston engine may have different/overriding implementations of functions behind the same name. This ability to hide different implementations of a function behind the same name is called polymorphism and it greatly simplifies communication among objects.
- composition-relationship With the concepts of composition-relationship, encapsulation, inheritance and polymorphism, an object can represent just about anything in the real world. In fact, one's logical perception of the reality is the only limit on determining the kinds of things that can become objects in object- oriented software. Some typical categories are as follows:
- Objects can represent physical objects, such as automobiles in a traffic-flow simulation, electrical components in a circuit-design program, countries in an economics model, or aircraft in an air-traffic-control system.
- Objects can represent elements of the computer-user environment such as windows, menus or graphics objects.
- An object can represent an inventory, such as a personnel file or a table of the latitudes and longitudes of cities.
- An object can represent user-defined data types such as time, angles, and complex numbers, or points on the plane.
- OOP allows the software developer to design and implement a computer program that is a model of some aspects of reality, whether that reality is a physical entity, a process, a system, or a composition of matter. Since the object can represent anything, the software developer can create an object which can be used as a component in a larger software project in the future.
- OOP enables software developers to build objects out of other, previously built objects.
- C++ is an OOP language that offers a fast, machine-executable code.
- C++ is suitable for both commercial-application and systems-programming projects.
- C++ appears to be the most popular choice among many OOP programmers, but there is a host of other OOP languages, such as Smalltalk, Common Lisp Object System (CLOS), and Eiffel. Additionally, OOP capabilities are being added to more traditional popular computer programming languages such as Pascal.
- object classes can be summarized, as follows:
- Encapsulation enforces data abstraction through the organization of data into small, independent objects that can communicate with each other. Encapsulation protects the data in an object from accidental damage, but allows other objects to interact with that data by calling the object's member functions and structures.
- Class hierarchies and containment hierarchies provide a flexible mechanism for modeling real-world objects and the relationships among them.
- Class libraries are very flexible. As programs grow more complex, more programmers are forced to adopt basic solutions to basic problems over and over again.
- a relatively new extension of the class library concept is to have a framework of class libraries. This framework is more complex and consists of significant collections of collaborating classes that capture both the small scale patterns and major mechanisms that implement the common requirements and design in a specific application domain. They were first developed to free application programmers from the chores involved in displaying menus, windows, dialog boxes, and other standard user interface elements for personal computers.
- Frameworks also represent a change in the way programmers think about the interaction between the code they write and code written by others.
- the programmer called libraries provided by the operating system to perform certain tasks, but basically the program executed down the page from start to finish, and the programmer was solely responsible for the flow of control. This was appropriate for printing out paychecks, calculating a mathematical table, or solving other problems with a program that executed in just one way.
- a programmer writing a framework program not only relinquishes control to the user (as is also true for event loop programs), but also relinquishes the detailed flow of control within the program to the framework. This approach allows the creation of more complex systems that work together in interesting ways, as opposed to isolated programs, having custom code, being created over and over again for similar problems.
- a framework basically is a collection of cooperating classes that make up a reusable design solution for a given problem domain. It typically includes objects that provide default behavior (e.g., for menus and windows), and programmers use it by inheriting some of that default behavior and overriding other behavior so that the framework calls application code at the appropriate times.
- default behavior e.g., for menus and windows
- Behavior versus protocol Class libraries are essentially collections of behaviors that you can call when you want those individual behaviors in your program.
- a framework provides not only behavior but also the protocol or set of rules that govern the ways in which behaviors can be combined, including rules for what a programmer is supposed to provide versus what the framework provides.
- a preferred embodiment of the invention utilizes HyperText Markup Language (HTML) to implement documents on the Internet together with a general-pu ⁇ ose secure communication protocol for a transport medium between the client and the Newco. HTTP or other protocols could be readily substituted for HTML without undue experimentation. Information on these products is available in T. Berners-Lee, D. Connoly, "RFC 1866: Hypertext
- HTML Markup Language - 2.0
- R. Fielding H, Frystyk, T. Berners-Lee, J. Gettys and J.C. Mogul, "Hypertext Transfer Protocol - HTTP/1.1 : HTTP Working Group Internet Draft” (May 2, 1996).
- HTML is a simple data format used to create hypertext documents that are portable from one platform to another.
- HTML documents are SGML documents with generic semantics that are appropriate for representing information from a wide range of domains.
- HTML has been in use by the World-Wide Web global information initiative since 1990. HTML is an application of ISO Standard 8879; 1986 Information Processing Text and Office Systems; Standard Generalized Markup Language (SGML).
- SGML Standard Generalized Markup Language
- HTML has been the dominant technology used in development of Web-based solutions.
- HTML has proven to be inadequate in the following areas:
- UI User Interface
- Custom “widgets” e.g., real-time stock tickers, animated icons, etc.
- client-side performance is improved.
- Java supports the notion of client-side validation, offloading appropriate processing onto the client for improved performance.
- Dynamic, real-time Web pages can be created. Using the above-mentioned custom UI components, dynamic Web pages can also be created.
- Sun's Java language has emerged as an industry-recognized language for "programming the Internet.”
- Sun defines Java as: "a simple, object-oriented, distributed, inte ⁇ reted, robust, secure, architecture-neutral, portable, high-performance, multithreaded, dynamic, buzzword- compliant, general-pu ⁇ ose programming language.
- Java supports programming for the Internet in the form of platform-independent Java applets.”
- Java applets are small, specialized applications that comply with Sun's Java Application Programming Interface (API) allowing developers to add "interactive content” to Web documents (e.g., simple animations, page adornments, basic games, etc.). Applets execute within a Java-compatible browser (e.g.,
- ActiveX Technologies to give developers and Web designers wherewithal to build dynamic content for the Internet and personal computers.
- ActiveX includes tools for developing animation, 3-D virtual reality, video and other multimedia content.
- the tools use Internet standards, work on multiple platforms, and are being supported by over 100 companies.
- the group's building blocks are called ActiveX Controls, small, fast components that enable developers to embed parts of software in hypertext markup language (HTML) pages.
- ActiveX Controls work with a variety of programming languages including Microsoft Visual C++, Borland Delphi, Microsoft Visual Basic programming system and, in the future, Microsoft's development tool for Java, code named "Jakarta.”
- ActiveX Technologies also includes ActiveX Server Framework, allowing developers to create server applications.
- ActiveX could be substituted for JAVA without undue experimentation to practice the invention.
- the right information may include information from all parts of the organization and from external sources, information in the context of the business process (regardless of the source or format), and relevant information about business entities and relationships (rather than keywords and documents).
- Delivering the right information to the right person may involve filtering of the information based on needs of the individual, and delivery of the filtered information to the individual or team.
- the right time may mean providing up-to-date information and information on demand.
- the right information can be delivered to the right person at the right time.
- the information can come from internal and external sources.
- the information can also be cleansed, integrated and placed in the right business context and also be customized to meet an individual's particular needs.
- Embodiments of the present invention also allow information to be delivered proactively (i.e., "pushed").
- FIG. 2 is a schematic diagram of a knowledgebase framework 200 in accordance with an embodiment of the present invention.
- a knowledgebase framework With such a knowledgebase framework, a wide variety of information sources may be extracted, cleansed and structured based on a knowledge model to create a highly customized index that can be accessed and browsed by a user.
- information may be contained in a plurality of internal sources 202 and external sources 204.
- An internal source 202 of information is typically an information source that is an under the control of entity that employs the user and whose information may be proprietary to the entity.
- Internal sources of information may include, for example: discovery information, PD information, clinical information, regulatory information, and M&S information.
- An external source 204 of information is typically an information source that is not under the control of entity that employs the user.
- An external source may typically be accessible utilizing a wide are network such as the Internet and World Wide Web.
- External sources may include for example: bio-analysis information, study management information, safety data information, market report information, and Internet websites including government, public, and subscription based websites.
- the knowledgebase framework may also include an index creator 206 which is connected to the internal and external sources 202, 204 by a network.
- the index creator 206 may also include or have access to a knowledgebase model 208. Utilizing the knowledgebase model 208, the index creator 206 may extract a wide variety information from the internal and external sources 202, 204, cleanse the extracted information, restructure the extracted information and then reconcile the extracted information into a knowledge model -based index.
- the knowledgebase framework 200 may also include an index database 210 coupled to the index creator 206 for storing the knowledge model-based index created by the index creator 206. Users may then access the knowledge model-based index stored in the database 210 from a browser/portal 212 utilizing the network.
- the knowledgebase framework may also include a web server 214 or other similar type of computer for interfacing the browser/portal 212 with the database 210.
- the knowledgebase framework may include a decision support application 216 for helping a user determine what is the right information for the user and help the user receive the right information at the right time for the user.
- the decision support application 216 (in combination with the browser/portal 212) provides the user with the capability to browse and navigate through an integrated web of knowledge regardless the location of the knowledge sources.
- the decision support application 216 also allows the user to access internal and external information.
- the decision support application 216 may also be used to provide a user with information tailored for a specific process such as, for example, a drug discovery.
- the decision support application 216 may further be used to help deliver the right information to the user by allowing them to monitor internal and external events at a wide range of granularity.
- Figure 3 is a schematic diagram of the knowledgebase framework illustrating how the index may be used as a user's primary gateway to a wide variety of information sources 202, 204 in accordance with an embodiment of the present invention. From this index and with the assistance of the decision support application 216, a user has the ability to retrieve the original information source.
- Some illustrative information sources may include a genomics database 300, a pre-clinical database 302, a clinical database 304, a departmental reports database 306, a subscription based information database 308, and/or market report database 310. Some of these information sources may comprises exclusively internal or external sources while other information sources may comprise a combination of internal and external sources.
- the departmental reports database 306 may exclusively comprise internal information sources, while the subscription based information database 308, and market report database 310 may comprise exclusively external information sources.
- the genomics database 300, the pre-clinical database 302, and the clinical database 304 may all comprise a combination of internal and external information sources.
- Figure 4 is a schematic diagram of an illustrative knowledge model 208 in accordance with an embodiment of the present invention.
- the knowledge model comprises a plurality of interrelated items.
- the items of the knowledge model may include: a therapeutic area item 400, a target item 402, disease item 404, a scientist item 406, an organization item 408, a patent item 410, a compound item 412, a literature item 414, a FDA approval item 416, and a drug item 418.
- connecting line 420 The inter-relations between the various items of the knowledge model are illustrated in Figure 4 by the plurality of connecting lines (e.g., connecting line 420)connecting each of the items to various other items.
- the literature item 414 is directly related with the target item 402, the scientist item 406, the organization item 408, the compound item 412, and drug item
- the knowledge model also helps to provide an organizational structure to the index generated in the knowledgebase framework so that the items of the generated index are arranged according to the organization structure.
- the organizational structure of the generated index may be based on the inter-relations between the items of the knowledge model.
- Figure 5 illustrates a flowchart for a process 500 for a knowledgebase framework in accordance with an embodiment of the present invention.
- Information is obtained from at least one knowledge source utilizing a network in operation 502.
- an index is generated for the obtained information in operation 504.
- This generated index includes a plurality of items (or entries) each associated with at least some of the obtained information.
- the generated index may then be stored in an index database.
- the generated index is displayed to a user in operation 506.
- the user is permitted to select an least one of the items of the index in operation 508.
- the information associated with the selected item is then displayed to the user utilizing the network in operation
- one of the knowledge sources from which information is obtained may be an internal source under the control of entity that employs the user and whose information therein may be proprietary to the entity.
- Some illustrative examples of internal sources include: a genomics database, a pre-clinical database, a clinical database, and/or a departmental reports database.
- one of the knowledge sources from which information is obtained may be an external source (e.g., a website) accessible utilizing a wide are network such as the Internet and World Wide Web.
- the external sources may not typically be under the control of entity that employs the user.
- Some illustrative examples of external sources include subscription based information, and/or market reports.
- the information obtained from the sources may include pharmaceutical information such as, for example, information relating to: a pharmaceutical therapeutic area, a pharmaceutical target, a pharmaceutical compound, a disease, a patent, the Federal Drug Administration (FDA) (such as information regarding FDA approval of a pharmaceutical), a person researching or working on a pharmaceutical, and/or pharmaceutical literature such as a periodical.
- FDA Federal Drug Administration
- the network may be utilized to monitor one or more of the knowledge sources for updated information relating to one or more items in the index.
- a notice may be generated regarding the updated information. This notice may then be transmitted to the user utilizing the network to notify the user of the updated information.
- the user may be allowed to select the knowledge source(s) to be monitored for updates or other changes.
- the user may be permitted to input a search term for searching the index utilizing the network.
- the index may be searched for items associated with the search term. Items of the index associated with the input search term (i.e., that match the search term) may then be displayed to the user utilizing the network.
- the items of the index may be organized and displayed in some sort of a hierarchical format such as, for example, a hierarchical tree format.
- displaying of the information associated with the selected item (or entry) to the user may also include utilizing the network to retrieve the associated information from the knowledgebase source (such as a website) from which the associated information was obtained.
- the network may be capable of communicating using TCP/IP protocol.
- Figure 6 illustrates a flowchart for a process 600 for a knowledge model in accordance with an embodiment of the present invention.
- a network is utilized to access or one or more knowledge sources in operation 602.
- Information from the knowledge source is then extracted utilizing the network in operation 604.
- the extracted information may have its own formatting.
- the format of the extracted information is converted in operation 606 into a common or standardized format (i.e., cleansing and restructuring).
- An index is generated for the standardized extracted information utilizing a knowledge model in operation 608.
- the generated index is then stored in an index database in operation 610.
- the knowledge model may include a plurality of inter- associated or inter-related items.
- generation of the index may include associating the extracted information with one or more of the items of the model, and then mapping the extracted information to the associated item. In this manner, when the index is displayed to a user, selection of the item by a user links the user to the associated information and the source of the information.
- the items of the knowledge model may include a therapeutic area item, a target item, disease item, a scientist item, an organization item, a patent item, a compound item, a literature item, a FDA approval item, and/or a drug item.
- the knowledge model may also provide an organizational structure to the generated index so that the items of the generated index are arranged according to the organization structure.
- the organizational structure may be a hierarchical tree of the items.
- the extracted information may include pharmaceutical information.
- the knowledge sources may include one or more an internal knowledge sources, and/or an external knowledge sources.
- the network may be capable of communicating using TCP/IP protocol.
- a user may be permitted to access the database utilizing the network to retrieve the stored index.
- a query may be received utilizing the network whereupon, the index may be searched for information matching the query to thereby permit retrieval of the matching information utilizing the network.
- the knowledgebase framework 200 may be used to help a user learn about a field and/or catch up on new developments in this field.
- a user may be able to use the knowledgebase framework 200 to find people who are involved in the area being studied and their background, previous research work done in the area (which in an illustrative embodiment may include a list of targets, compounds and drugs), and obtain research reports relating to the area being studied.
- the user may utilize the knowledgebase framework 200 to find information from external sources such as, for example: recent patents, targets, compounds, and drugs relating to the area being studied, as well the people (such as scientists) who are actively working in this field or area of study.
- Figure 7 is a schematic illustration of a log in frame 700 for permitting a user to log into the knowledgebase framework utilizing a network in accordance with an embodiment of the present invention.
- the log in frame 700 may include a user name field 702 for permitting a user to enter a user name associated therewith, a password field 704 for permitting the user to enter a password associated with the user and the user name, and a selectable log in button 706 for permitting the user to log into the knowledgebase framework upon selection thereof.
- FIG. 8 is a schematic illustration of a search and browse frame 800 of the decision support application for permitting a user browse the knowledge model-based index in accordance with an embodiment of the present invention.
- Displayed in the search and browse frame 800 is a portion of the items of an illustrative knowledge model-based index.
- the items of the index being displayed comprises the therapeutic area items 802 of the illustrative index including, for example, a cardiology/vascular diseases 804 therapeutic area item.
- the items of the index displayed in the search and browse frame 800 also comprise selectable links which upon selection thereof by the user leads to the retrieval from the index database 210 and the displaying in the search and browse frame 800 of items of the knowledge model-based index related to the selection item.
- Figure 9 is a schematic illustration of the items of the knowledge model-base index displayed in the search and browse frame 800 upon selection of the cardiology/vascular diseases item 804 in Figure 8 in accordance with an embodiment of the present invention. These items also comprise selectable links so that selection of one of them leads to the retrieving and display of other items of the index related to the selected item. Included in Figure 9, are disease items 900 and biological target items 902 of the illustrative index related to the selected therapeutic target item 804. In an illustrative example, one of the biological target items of the index being displayed in Figure 9 may include an ACE biological target item 904.
- Figure 10 is a schematic illustration of the items of the knowledge model-base index displayed in the search and browse frame 800 upon selection of the ACE biological target item 904 in Figure 9 in accordance with an embodiment of the present invention. These items also comprise selectable links so that selection of one of them leads to the retrieving and display of other items of the index related to the selected item. Included in Figure 10, are component items 1002 of the illustrative index related to the selected item. In an illustrative example, one of the compound items of the index being displayed in Figure 10 may include a captopril compound item 1004.
- the search and browse frame As illustrated in Figure 10, in one aspect of the present invention, the search and browse frame
- the 800 may include a display bar 1006 in which the hierarchy 1008 in the index of the selected item may be displayed.
- the displayed hierarchy 1008 may comprises the target name ACE, and the therapeutic area cardiology/vascular.
- Figure 11 is a schematic illustration of illustrative items of the knowledge model-base index displayed in the search and browse frame 800 upon selection of the captopril compound item 1004 in accordance with an embodiment of the present invention. These items also comprise selectable links so that selection of one of them leads to the retrieving of information from the knowledgebase framework 200 and/or other items of the index related to the selected item. Included in Figure 11, are illustrative items of the illustrative index related to the selected item
- the displayed related items include: related drug items 1100, related patent items 1102, related scientists and/or people items 1104, related literature items 1106, and related disease items 1108.
- one of the items may be selected (such as by clicking the right button of a mouse when the mouse pointer is over the item, i.e., "right clicking") to display a pop-up menu 1110 which includes a monitor selection 1112 and a visit source selection 1114.
- Figure 12 is a schematic illustration of a browser frame 1200 which displays information 1202 from knowledge source relating to an item (in this example the captopril drug item 1004) upon selection of the visit source selection 1114 of Figure 11 in accordance with an embodiment of the present invention.
- the information 1202 displayed in the browser frame 1200 relates to the drug captopril and is obtained from an external source, in this case an illustrative website 1204 on the Internet
- Figure 13 is a schematic illustration of an exemplary display of the display bar 1006 upon selection of a scientist/person item 1104 in accordance with an embodiment of the present invention.
- the knowledgebase framework may retrieve contact information for the scientist/person from a knowledge source and then display the retrieved contact information 1300 in the display bar 1006.
- Figure 14 is a schematic illustration of illustrative items of the knowledge model-base index displayed in the search and browse frame 800 upon selection of the patent item 1116 of Figure 11 in accordance with an embodiment of the present invention.
- the displayed items related to the selected patent item 1116 may include items for patent cited 1400 in the selected patent 1116, items for organizations 1402 related to the selected patent, items for FDA approval matters 1404 related to the selected patent, items for scientists and/or people 1406 associated with the selected patent, and items for compounds 1004 related to the selected patent.
- Figure 15 is a schematic illustration of a browser frame 1500 which displays information 1502 from knowledge source relating to the selected patent item of Figure 14 upon selection of the visit source selection 1114 of Figure 14 in accordance with an embodiment of the present invention.
- the information 1502 displayed in the browser frame 1500 relates to US Patent 5,238,924 and is obtained from an external source, in this case, the USPTO website's Patent Full Text and Image Database.
- a user may be able to monitor work done by others, such as scientists researching a particular area or field. This may be accomplished by selecting the monitor selection 1110 of a selected item, such as for example a scientist item 1118 displayed the search and browse frame of Figure 11.
- any of the items of the index can be monitored for changes and updates to related information.
- a user can select for monitoring an organization for any new publications, patents, drug applications, and recent news associated with the organization.
- a target item may be monitored for any new information related to monitored target.
- Figure 16 illustrates a recent news frame 1600 that the user may access after logging on to the network in accordance with an embodiment of the present invention.
- a portion of the recent news frame 1600 may comprise a recent news portion 1602 and a monitored items portion 1604.
- links 1606, 1608 to recently updated information may be displayed.
- selection of link 1606 (which is a link to a patent) may result in the retrieval and displaying of a browser frame similar to the browser frame of Figure 15 to display information relating to the selected patent obtained from the USPTO website's Patent Full Text and Image Database.
- the links may also comprise hypertext links and include textual summaries relating to information that was recently updated (see e.g., link 1608).
- these links may be organized by subjects or items such as for example, the items of the knowledge model 208 and displayed in corresponding subheadings.
- some illustrative subheadings may include an organizations subheading 1610, a targets subheading 1612, and a scientist subheading 1614.
- the monitored items portion 1604 may display a list of items 1616 selected by the user to be monitored by the knowledgebase framework. Like the recent news links 1606, the items 1616 in the monitored items portion 1604 may comprise links to access items in the knowledge model- based index.
- a search bar 1618 may also be displayed to the user.
- the search bar 1618 may include at least one field 1620 for the user to input a search term.
- a search may be conducted by a search engine for items in the index related to the inputted term(s).
- Figure 17 illustrates a flowchart for a process 1700 for monitoring information in a knowledgebase framework in accordance with an embodiment of the present invention.
- information is received from a user that relates to a target to be tracked in operation 1702.
- the network is monitored for changes or updates to the target 1704.
- data relating to the monitored changes is retrieved in operation 1706 and then transmitted to the user utilizing the network in operation 1708.
- the target may be an item of index displayed to the user utilizing the network.
- the target may be: a publication
- a drug e.g., literature
- a person e.g., scientist
- a therapeutic area e.g., a disease, a biological target, an organization, a compound, a patent, FDA approval, and/or a drug.
- a pharmaceutical database may be monitored for changes or updates relating to the target.
- the network may comprise an intranet of an organization and the Internet.
- the received information may be stored in memory.
- the retrieved data may be transmitted to the user after receipt of an indication that the user has logged on to the network.
- the retrieved data may be automatically transmitted to the user after receipt of the indication that the user has logged on to the network.
- the user may be alerted that a change or update to the target has been monitored utilizing the network.
- the user may be permitted to input a search term utilizing the network.
- items associated with the search term may be searched for upon receipt of the search term. Then those items which have been found to be associated with the inputted search term may be displayed to the user utilizing the network.
- FIG 18A is a schematic illustration of a process map 1800 displayed in a research frame 1802 for use in a knowledgebase framework in accordance with an embodiment of the present invention.
- a user may be able to share a set of common processes and report templates with one or more other users to document their findings for further collaboration.
- the process map graphically depicts a timeline for one or more processes.
- the depicted process map 1800 may be for process for the development of a product 1804 in a pharmaceutical setting.
- the research frame 1802 may also include selectable links for accessing various tools for the research frame such as for example, templates 1806 and target tracking tools 1808.
- Figure 18B is a schematic illustration of an illustrative template 1810 for a process map displayed in the research frame 1802 in accordance with an embodiment of the present invention.
- Figure 18C illustrates an inbox 1812 for managing email messages displayed in an organizer frame 1814 for use in a knowledgebase framework in accordance with an embodiment of the present invention.
- the organizer frame may also include a plurality of tool links for accessing features of the organizer frame.
- Illustrative examples of tool links include a mail link 1816 for displaying the inbox upon selection thereof and a calendar link 1818.
- a pull-down menu 1820 that may be included in the search bar 1618 to further narrow a search of terms input into the search bar.
- Figure 18D illustrates a calendar 1822 for managing events and appointments of a user displayed in an organizer frame 1814 for use in a knowledgebase framework in accordance with an embodiment of the present invention.
- the calendar 1822 may be displayed upon selection of the calendar link 1818 by a user.
- Acme Pharmaceutical has just installed a knowledgebase framework.
- the knowledgebase framework allows Acme Pharmaceutical's scientists to search, browse and monitor internal and external information available to them. Anne accesses the knowledgebase framework from her computer desktop. Anne accesses the knowledgebase framework from her computer desktop. She spends almost the entire day using the knowledgebase framework and at the end of the day she is able to find:
- Anne finds two scientists whose work seem to be relevant to her first assignment. She sets up her profile in the knowledgebase framework in such a way that it will monitor any future work done by these scientists ...
- Anne's first assignment is to investigate TR27 K-Channel as a potential target for hypertension treatment. She uses the knowledgebase framework to find out any previous work related TR27. She finds only one article that are somewhat relevant. Since she will be working on this target for awhile, she sets up the knowledgebase framework to monitor any new information related to TR27. One morning a couple days later, Anne turns on her computer and the knowledgebase framework informs her that Pfizer has filed a patent and this patent has cited TR27. Anne quickly browses through the patent. Fortunately, the patent cited TR27 for a different reason.
- Target DB a tool that stores information of all targets investigated by Acme Pharmaceutical, from the knowledgebase framework to find out the details information about assay used for TR27.
- Figure 19 is a schematic diagram of an exemplary system architecture in accordance with an embodiment of the present invention.
- a BackgroundFinder is implemented as an agent responsible for preparing an individual for an upcoming meeting by helping him her retrieve relevant information about the meeting from various sources.
- BF receives input text in character form indicative of the target meeting.
- the input text is generated in accordance with an embodiment of the present invention by a calendar program that includes the time of the meeting. As the time of the meeting approaches, the calendar program is queried to obtain the text of the target event and that information is utilized as input to the agent. Then, the agent parses the input meeting text to extract its various components such as title, body, participants, location, time etc.
- the system also performs pattern matching to identify particular meeting fields in a meeting text.
- This information is utilized to query various sources of information on the web and obtain relevant stories about the current meeting to send back to the calendaring system. For example, if an individual has a meeting with Netscape and Microsoft to talk about their disputes, and would obtain this initial information from the calendaring system.
- the system queries the web for relevant information concerning the topic.
- the system updates the calendaring system and eventually the user with the best information it can gather to prepare the user for the target meeting.
- the information is stored in a file that is obtained via selection from a link imbedded in the calendar system.
- a computer program in accordance with an embodiment of the present invention is organized in five distinct modules: BF.Main, BF.Parse, Background Finder.Error, BF.PatternMatching and
- Microsoft's Winsock control An embodiment of the system executes in two different modes which can be specified under the command line sent to it by the calendaring system. When the system runs in simple mode, it executes a keyword query to submit to external search engines.
- the system When executed in complex mode, the system performs pattern matching before it forms a query to be sent to a search engine.
- the system in accordance with an embodiment of the present invention utilizes three user defined structures: TMeetingRecord;
- the user-defined structure, tMeetingRecord is used to store all the pertinent information concerning a single meeting.
- This info includes userlD, an original description of the meeting, the extracted list of keywords from the title and body of meeting etc. It is important to note that only one meeting record is created per instance of the system in accordance with an embodiment of the present invention. This is because each time the system is spawned to service an upcoming meeting, it is assigned a task to retrieve information for only one meeting. Therefore, the meeting record created corresponds to the current meeting examined. ParseMeetingText populates this meeting record and it is then passed around to provide information about the meeting to other functions.
- the record tAPatternRecord is an array containing all the components / elements of a pattern.
- the type tAPattemElement is an array of strings which represent an element in a pattern. Because there may be many "substitutes" for each element, we need an array of strings to keep track of what all the substitutes are.
- the structures of tAPattemElement and tAPatternRecord are presented below in accordance with an embodiment of the present invention.
- Figure 20 depicts the overall process flow in accordance with an embodiment of the present invention. Processing commences at the top of the chart at function block 2000 which launches when the program starts. Once the application is started, the command line is parsed to remove the appropriate meeting text to initiate the target of the background find operation in accordance with an embodiment of the present invention as shown in function block 2010. A global stop list is generated after the target is determined as shown in function block 2020. Then, all the patterns that are utilized for matching operations are generated as illustrated in function block 2030. Then, by tracing through the chart, function block
- GoBF 2040 which is responsible for logical processing associated with wrapping the correct search query information for the particular target search engine.
- function block 2040 flows to function block 2050 and it then calls GoPattemMatch as shown in function block 2060.
- GoPattemMatch we swap to the diagram titled "Process Flow for BF's Pattern Matching Unit.”
- the Alta Vista search engine utilizes the identifies and returns general information about topics related to the current meeting as shown in function block 270 of Figure 2.
- the system in accordance with an embodiment of the present invention takes all the keywords from the title portion of the original meeting text and constructs an advanced query to send to Alta Vista.
- the keywords are logically combined together in the query.
- the results are also ranked based on the same set of keywords.
- a date restriction or publisher criteria could be facilitated on the articles we want to retrieve.
- a set of top ranking stories are returned to the calendaring system in accordance with an embodiment of the present invention.
- the NewsPage search system is responsible for giving us the latest news topics related to a target meeting.
- the system takes all of the keywords from the title portion of the original meeting text and constructs a query to send to the NewsPage search engine.
- the keywords are logically combined together in the query. Only articles published recently are retrieved.
- the NewsPage search system provides a date restriction criteria that is settable by a user according to the user's preference. The top ranking stories are returned to the calendaring system.
- Figure 21 is a user profile data model in accordance with an embodiment of the present invention.
- Processing commences at function block 2100 which is responsible for invoking the program from the main module.
- function block 2110 a wrapper function is invoked to prepare for the keyword extraction processing in function block 2120.
- function block 2130 determines if the delimiters are properly positioned.
- function block 2140 the number of words in a particular string is calculated and the delimiters for the particular field are and a particular field from the meeting text is retrieved at function block 2150.
- the delimiters of the string are again checked to assure they are placed appropriately.
- function block 2160 the extraction of each word from the title and body of the message is performed a word at a time utilizing the logic in function block 2162 which finds the next closest word delimiter in the input phrase, function block 21 4 which strips unnecessary materials from a word and function block 2166 which determines if a word is on the stop list and returns an error if the word is on the stop list.
- Pattern Matching Limitations associated with a simple searching method include: 1. Because it relies on a stop list of unwanted words in order to extract from the meeting text a set of keywords, it is limited by how comprehensive the stop list is. Instead of trying to figure out what parts of the meeting text we should throw away, we should focus on what parts of the meeting text we want. 2. A simple search method in accordance with an embodiment of the present invention only uses the keywords from a meeting title to form queries to send to Alta Vista and
- Pattern matching is based on a set of templates that we specify, allowing us to identify people names, company names etc from a meeting text.
- a pattern in the context of an embodiment of the present invention is a template specifying the structure of a phrase we are looking for in a meeting text.
- the patterns supported by an embodiment of the present invention are selected because they are templates of phrases which have a high probability of appearing in someone's meeting text. For example, when entering a meeting in a calendar, many would write something such as "Meet with Bob Dutton from
- Terminology associated with pattern matching includes:
- Pattern a pattern is a template specifying the structure of a phrase we want to bind the meeting text to. It contains sub units.
- Element a pattern can contain many sub-units. These subunits are called elements. For example, in the pattern “with $PEOPLE$ from $COMPANY$”, “with” "SPEOPLES”
- Placeholder a placeholder is a special kind of element in which we want to bind a value to. Using the above example, "SPEOPLES" is a placeholder.
- Indicator an indicator is another kind of element which we want to find in a meeting text but no value needs to bind to it. There may be often more than one indicator we are looking for in a certain pattern. That is why an indicator is not an "atomic" type.
- substitutes are a set of indicators which are all synonyms of each other. Finding any one of them in the input is good.
- parentheses are the illustrative placeholders used in the code as representation of the corresponding meeting fields.
- Each placeholder may have the following meaning: SCOMPANYS: binds a string of capitalized words (e.g., Meet with Joe Carter of
- SLOCATIONS binds a string of capitalized words (e.g., Meet Susan at ⁇ Palo Alto
- STIMES binds a string containing the format #:## (e.g., Dinner at ⁇ 6:30 pm>)
- $TOPIC_UPPER$ binds a string of capitalized words for our topic (e.g., ⁇ Stanford
- STOP AALLS binds a string of words without really caring if it's capitalized or not.
- the following table represents patterns supported by BF.
- Each pattern belongs to a pattern group. All patterns within a pattern group share a similar format and they only differ from each other in terms of what indicators are used as substitutes. Note that the pattems which are grayed out are also commented in the code. BF has the capability to support these patterns but we decided that matching these patterns is not essential at this point.
- Figure 22 is a detailed flowchart of pattern matching in accordance with an embodiment of the present invention. Processing commences at function block 2200 where the main program invokes the pattern matching application and passes control to function block 2210 to commence the pattern match processing. Then, at function block 2220, the wrapper function loops through to process each pattern which includes determining if a part of the text string can be bound to a pattern as shown in function block 2230. Then, at function block 2240, various placeholders are bound to values if they exist, and in function block 2241, a list of names separated by punctuation are bound, and at function block 2242 a full name is processed by finding two capitalized words as a full name and grabbing the next letter after a space after a word to determine if it is capitalized.
- function block 2243 time is parsed out of the string in an appropriate manner and the next word after a blank space in function block 2244.
- function block 2245 the continuous phrases of capitalized words such as company, topic or location are bound and in function block 2246, the next word after the blank is obtained for further processing in accordance with an embodiment of the present invention.
- function block 2250 is utilized to locate an indicator which is the head of a pattern, the next word after the blank is obtained as shown in function block 2252 and the word is checked to determine if the word is an indicator as shown in function block
- the string is parsed to locate an indicator which is not at the end of the pattern and the next word after unnecessary white space such as that following a line feed or a carriage return is processed as shown in function block 2262 and the word is analyzed to determine if it is an indicator as shown in function block 2264.
- the temporary record is reset to the null set to prepare it for processing the next string and at function block 2280, the meeting record is updated and at function block 2282 a check is performed to determine if an entry is already made to the meeting record before parsing the meeting record again.
- FIG. 23 shows a flowchart of the detailed processing for preparing a query and obtaining information from the Internet in accordance with an embodiment of the present invention. Processing commences at function block 2300 and immediately flows to function block 2310 to process the wrapper functionality to prepare for an Internet search utilizing a web search engine.
- Alta Vista Search Engine If the search is to utilize the Alta Vista search engine, then at function block 2330, the system takes information from the meeting record and forms a query in function blocks 2340 to 2360 for submittal to the search engine. If the search is to utilize the NewsPage search engine, then at function block 2320, the system takes information from the meeting record and forms a query in function blocks 2321 to 2328.
- Alta Vista Search Engine :
- Alta Vista search engine provides enhanced flexibility. Using its advance query method, one can construct all sorts of Boolean queries and rank the search however you want. However, one of the biggest drawbacks with Alta Vista is that it is not very good at handling a large query and is likely to give back irrelevant results. If we can identify the topic and the company within a meeting text, we can form a pretty short but comprehensive query which will hopefully yield better results. We also want to focus on the topics found. It may not be of much merit to the user to find out info about a company especially if the user already knows the company well and has had numerous meetings with them. It's the topics they want to research on.
- a strength of the News Page search engine is that it does a great job searching for the most recent news if you are able to give it a valid company name. Therefore when we submit a query to the news page web site, we send whatever company name we can identify and only if we cannot find one do we use the topics found to form a query. If neither one is found, then no search is performed.
- the algorithm utilized to form the query to submit to Alta Vista is illustrated in Figure 25.
- the algorithm that we will use to form the query to submit to News Page is illustrated in Figure 26.
- Procedure Type Called By Description Name stop words as expected because stop words are words usually common to all meeting text so it makes sense they form patterns.
- the second type of elements are special strings which represent placeholders.
- a placeholder is always in the form of $*$ where * can be either PEOPLE, COMPANY, TOPIC JJPPER, TEVIE,LOCATION or TOPIC_ALL.
- a pattern can begin with either one of the two types of elements and can be however long, involving however any number/type of elements. This procedure dynamically creates a new pattern record for each pattern in the table and it also dynamically creates new tAPatternElements for each element within a pattern. In addition, there is the concept of being able to substitute indicators within a pattern. For example, the pattern SPEOPLES of
- Figure 24 shows a flowchart of the actual code utilized to prepare and submit searches to the Alta Vista and NewsPage search engines in accordance with an embodiment of the present invention.
- Processing commences at function block 2401 where a command line is utilized to update a calendar entry with specific calendar information.
- the message is next posted in accordance with function block 2402 and a meeting record is created to store the current meeting information in accordance with function block 2403.
- function block 2404 the query is submitted to the Alta Vista search engine and in function block 2405, the query is submitted to the NewsPage search engine.
- a message is returned from the search engine, it is stored in a results data stmcture as shown in function block 2406 and the information is processed and stored in summary form in a file for use in preparation for the meeting as detailed in function block 2407.
- Figure 25 provides more detail on creating the query in accordance with an embodiment of the present invention. Processing commences at function block 2505 where the meeting record is parsed to obtain potential companies, people, topics, location and a time. Then, in function block 2506, at least one topic is identified and in function block 2507, at least one company name is identified and finally in function block 2508, a decision is made on what material to transmit to the file for ultimate consumption by the user.
- Figure 26 is a variation on the query theme presented in Figure 25.
- a meeting record is parsed in function block 2600, a company is identified in function block 2620, a topic is identified in function block 2630 and finally in function block 2640 the topic and or the company is utilized in formulating the query.
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Theoretical Computer Science (AREA)
- Public Health (AREA)
- Epidemiology (AREA)
- Physics & Mathematics (AREA)
- Primary Health Care (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Mathematical Physics (AREA)
- Pharmacology & Pharmacy (AREA)
- Medicinal Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Toxicology (AREA)
- Biomedical Technology (AREA)
- Software Systems (AREA)
- Pathology (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Medical Treatment And Welfare Office Work (AREA)
- User Interface Of Digital Computer (AREA)
- Stored Programmes (AREA)
- Computer And Data Communications (AREA)
- Information Transfer Between Computers (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP01914581A EP1402397A2 (en) | 2000-03-08 | 2001-02-28 | Method for a knowledge model |
CA2404084A CA2404084C (en) | 2000-03-08 | 2001-02-28 | Method for a knowledge model |
AU2001239950A AU2001239950B2 (en) | 2000-03-08 | 2001-02-28 | Method for a knowledge model |
AU3995001A AU3995001A (en) | 2000-03-08 | 2001-03-02 | Method for a knowledge model |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US52054300A | 2000-03-08 | 2000-03-08 | |
US52094200A | 2000-03-08 | 2000-03-08 | |
US09/520,942 | 2000-03-08 | ||
US09/520,543 | 2000-03-08 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2001067280A2 true WO2001067280A2 (en) | 2001-09-13 |
WO2001067280A3 WO2001067280A3 (en) | 2003-12-24 |
Family
ID=27060177
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2001/006457 WO2001067280A2 (en) | 2000-03-08 | 2001-02-28 | Method for a knowledge model |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1402397A2 (en) |
AU (2) | AU2001239950B2 (en) |
CA (1) | CA2404084C (en) |
WO (1) | WO2001067280A2 (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002021336A2 (en) * | 2000-09-06 | 2002-03-14 | Cellomics, Inc. | Method and system for interpreting and validating experimental data with automated reasoning |
EP1825355A2 (en) * | 2004-11-12 | 2007-08-29 | Bobick Mark | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
EP1990761A1 (en) * | 2001-12-20 | 2008-11-12 | Accenture Global Services GmbH | Business transaction management |
US8024653B2 (en) | 2005-11-14 | 2011-09-20 | Make Sence, Inc. | Techniques for creating computer generated notes |
US8126890B2 (en) | 2004-12-21 | 2012-02-28 | Make Sence, Inc. | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
US8140559B2 (en) | 2005-06-27 | 2012-03-20 | Make Sence, Inc. | Knowledge correlation search engine |
US8249903B2 (en) | 2006-10-10 | 2012-08-21 | Accenture Global Services Limited | Method and system of determining and evaluating a business relationship network for forming business relationships |
US8898134B2 (en) | 2005-06-27 | 2014-11-25 | Make Sence, Inc. | Method for ranking resources using node pool |
US9330175B2 (en) | 2004-11-12 | 2016-05-03 | Make Sence, Inc. | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5506984A (en) * | 1993-06-30 | 1996-04-09 | Digital Equipment Corporation | Method and system for data retrieval in a distributed system using linked location references on a plurality of nodes |
WO1997038376A2 (en) * | 1996-04-04 | 1997-10-16 | Flair Technologies, Ltd. | A system, software and method for locating information in a collection of text-based information sources |
-
2001
- 2001-02-28 AU AU2001239950A patent/AU2001239950B2/en not_active Expired
- 2001-02-28 WO PCT/US2001/006457 patent/WO2001067280A2/en not_active Application Discontinuation
- 2001-02-28 CA CA2404084A patent/CA2404084C/en not_active Expired - Lifetime
- 2001-02-28 EP EP01914581A patent/EP1402397A2/en not_active Ceased
- 2001-03-02 AU AU3995001A patent/AU3995001A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5506984A (en) * | 1993-06-30 | 1996-04-09 | Digital Equipment Corporation | Method and system for data retrieval in a distributed system using linked location references on a plurality of nodes |
WO1997038376A2 (en) * | 1996-04-04 | 1997-10-16 | Flair Technologies, Ltd. | A system, software and method for locating information in a collection of text-based information sources |
Non-Patent Citations (3)
Title |
---|
LEE M ET AL: "A knowledge acquisition framework for an intelligent decision-support system" INTELLIGENT INFORMATION SYSTEMS,1994. PROCEEDINGS OF THE 1994 SECOND AUSTRALIAN AND NEW ZEALAND CONFERENCE ON BRISBANE, QLD., AUSTRALIA 29 NOV.-2 DEC. 1994, NEW YORK, NY, USA,IEEE, 29 November 1994 (1994-11-29), pages 432-436, XP010136758 ISBN: 0-7803-2404-8 * |
NADA LAVRAC & IGOR MOZETIC: "Second Generation Knowledge Acquisition Methods and their Application to Medicine" AUSTRIAN RESEARCH INSTIUTE FOR AI, [Online] 1992 - 1992, pages 1-21, XP002255124 Vienna Retrieved from the Internet: <URL:www.citeseer.com> [retrieved on 2003-09-19] * |
VOSS A ET AL: "Concept indexing" GROUP'99. PROCEEDINGS OF THE INTERNATIONAL ACM SIGGROUP CONFERENCE ON SUPPORTING GROUP WORK, PROCEEDINGS OF GROUP 99: CONFERENCE ON SUPPORTING GROUP WORK, PHOENIX, AZ, USA, 14-17 NOV. 1999, pages 1-10, XP002255123 1999, New York, NY, USA, ACM, USA ISBN: 1-58113-065-1 * |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002021336A3 (en) * | 2000-09-06 | 2003-07-31 | Cellomics Inc | Method and system for interpreting and validating experimental data with automated reasoning |
WO2002021336A2 (en) * | 2000-09-06 | 2002-03-14 | Cellomics, Inc. | Method and system for interpreting and validating experimental data with automated reasoning |
EP1990761A1 (en) * | 2001-12-20 | 2008-11-12 | Accenture Global Services GmbH | Business transaction management |
US8046238B2 (en) | 2001-12-20 | 2011-10-25 | Accenture Global Services Limited | Business transaction management |
US9311601B2 (en) | 2004-11-12 | 2016-04-12 | Make Sence, Inc. | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
EP1825355A2 (en) * | 2004-11-12 | 2007-08-29 | Bobick Mark | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
EP1825355A4 (en) * | 2004-11-12 | 2009-11-25 | Make Sence Inc | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
US10467297B2 (en) | 2004-11-12 | 2019-11-05 | Make Sence, Inc. | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
US8108389B2 (en) | 2004-11-12 | 2012-01-31 | Make Sence, Inc. | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
US9330175B2 (en) | 2004-11-12 | 2016-05-03 | Make Sence, Inc. | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
US8126890B2 (en) | 2004-12-21 | 2012-02-28 | Make Sence, Inc. | Techniques for knowledge discovery by constructing knowledge correlations using concepts or terms |
US8898134B2 (en) | 2005-06-27 | 2014-11-25 | Make Sence, Inc. | Method for ranking resources using node pool |
US8140559B2 (en) | 2005-06-27 | 2012-03-20 | Make Sence, Inc. | Knowledge correlation search engine |
US9477766B2 (en) | 2005-06-27 | 2016-10-25 | Make Sence, Inc. | Method for ranking resources using node pool |
US9213689B2 (en) | 2005-11-14 | 2015-12-15 | Make Sence, Inc. | Techniques for creating computer generated notes |
US8024653B2 (en) | 2005-11-14 | 2011-09-20 | Make Sence, Inc. | Techniques for creating computer generated notes |
US8249903B2 (en) | 2006-10-10 | 2012-08-21 | Accenture Global Services Limited | Method and system of determining and evaluating a business relationship network for forming business relationships |
Also Published As
Publication number | Publication date |
---|---|
EP1402397A2 (en) | 2004-03-31 |
CA2404084C (en) | 2011-02-22 |
CA2404084A1 (en) | 2001-09-13 |
AU3995001A (en) | 2001-09-17 |
WO2001067280A3 (en) | 2003-12-24 |
AU2001239950B2 (en) | 2005-03-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6957205B1 (en) | Knowledge model-based indexing of information | |
US7171448B1 (en) | Conducting activities in a collaborative work tool architecture | |
US7082430B1 (en) | Collaboration planning in a collaborative work tool architecture | |
US7076736B2 (en) | Method and apparatus for sharing many thought databases among many clients | |
Karger et al. | Haystack: A customizable general-purpose information management tool for end users of semistructured data | |
US8370331B2 (en) | Dynamic visualization of search results on a graphical user interface | |
JP4312954B2 (en) | Information management system | |
US6446076B1 (en) | Voice interactive web-based agent system responsive to a user location for prioritizing and formatting information | |
US20030227487A1 (en) | Method and apparatus for creating and accessing associative data structures under a shared model of categories, rules, triggers and data relationship permissions | |
US20050071305A1 (en) | System, method and article of manufacture for a knowledge model | |
US7152054B2 (en) | Context-based help engine, dynamic help, and help architecture | |
EP1003110A2 (en) | Property-based user level document management | |
CA2795540A1 (en) | A system, method and article of manufacture for advanced information gathering for targetted activities | |
EP1415245A2 (en) | A method for a graphical user interface search filter generator | |
US8600982B2 (en) | Providing relevant information based on data space activity items | |
CA2404084C (en) | Method for a knowledge model | |
AU2001239950A1 (en) | Method for a knowledge model | |
Baldonado | A user‐centered interface for information exploration in a heterogeneous digital library | |
CA2350314C (en) | A system, method and article of manufacture for effectively interacting with a network user | |
US20090187541A1 (en) | Computer method and system for contextual management and awareness of persistent queries and results | |
US20050015486A1 (en) | System, method and article of manufacture for organization monitoring | |
AU2001253587B2 (en) | Method for a collaborative work tool environment | |
Lin et al. | Building a topic map repository | |
AU2007202886B2 (en) | Method for a collaborative work tool environment | |
Zoller et al. | WEBCON: a toolkit for an automatic, data dictionary based connection of databases to the WWW |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM HR HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) |
Free format text: (EXCEPT CO) |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2404084 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2001239950 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2001914581 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2001914581 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: JP |
|
WWG | Wipo information: grant in national office |
Ref document number: 2001239950 Country of ref document: AU |
|
WWR | Wipo information: refused in national office |
Ref document number: 2001914581 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2001914581 Country of ref document: EP |