US20150074565A1 - Interfaces for providing enhanced connection data for shared resources - Google Patents
Interfaces for providing enhanced connection data for shared resources Download PDFInfo
- Publication number
- US20150074565A1 US20150074565A1 US14/082,481 US201314082481A US2015074565A1 US 20150074565 A1 US20150074565 A1 US 20150074565A1 US 201314082481 A US201314082481 A US 201314082481A US 2015074565 A1 US2015074565 A1 US 2015074565A1
- Authority
- US
- United States
- Prior art keywords
- data
- metadata
- data sources
- computer system
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/248—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04842—Selection of displayed objects or displayed text elements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3325—Reformulation based on results of preceding query
- G06F16/3326—Reformulation based on results of preceding query using relevance feedback from the user, e.g. relevance feedback on documents, documents sets, document terms or passages
- G06F16/3328—Reformulation based on results of preceding query using relevance feedback from the user, e.g. relevance feedback on documents, documents sets, document terms or passages using graphical result space presentation or visualisation
Definitions
- Computer databases have long been used to store and provide access to data. Computer databases allow users to enter queries and receive results that are intended to satisfy the query. In many cases, the user's query may be answered by data that is stored in multiple different databases. The user may not know that the data they desire to access is stored on these databases, and, even if the user is aware of the data's location, the user may not know how to access the data. This data is often referred to as “dark data”. The dark data is data that is required (or is at least useful) in answering the user's query, but is not accessible by the user.
- Embodiments described herein are directed to generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources.
- a computer system generates a UI that shows an indication of data elements to which a metadata catalog is linked.
- the metadata catalog includes references to different data sources that store data elements.
- the UI receives an input indicating that metadata for at least one of the data sources is to be modified.
- the modification includes providing additional metadata for a specified data source, providing documentation for a specified data source and/or providing an entry point for an access request workflow.
- the computer system performs the received data source metadata modification according to the received input and, upon determining that the metadata has been modified, displays the updated metadata for the data sources in the UI.
- FIG. 1 illustrates a computer architecture in which embodiments described herein may operate including generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources.
- FIG. 2 illustrates a flowchart of an example method for generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources.
- FIG. 3 illustrates example interactions between a data manager portal and a metadata catalog.
- Embodiments described herein are directed to generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources.
- a computer system generates a UI that shows an indication of data elements to which a metadata catalog is linked.
- the metadata catalog includes references to different data sources that store data elements.
- the UI receives an input indicating that metadata for at least one of the data sources is to be modified.
- the modification includes providing additional metadata for a specified data source, providing documentation for a specified data source and/or providing an entry point for an access request workflow.
- the computer system performs the received data source metadata modification according to the received input and, upon determining that the metadata has been modified, displays the updated metadata for the data sources in the UI.
- Computing systems are now increasingly taking a wide variety of forms. Computing systems may, for example, be handheld devices, appliances, laptop computers, desktop computers, mainframes, distributed computing systems, or even devices that have not conventionally been considered a computing system.
- the term “computing system” is defined broadly as including any device or system (or combination thereof) that includes at least one physical and tangible processor, and a physical and tangible memory capable of having thereon computer-executable instructions that may be executed by the processor.
- a computing system may be distributed over a network environment and may include multiple constituent computing systems.
- a computing system 101 A typically includes at least one processing unit 102 A and memory 103 A.
- the memory 103 may be physical system memory, which may be volatile, non-volatile, or some combination of the two.
- the term “memory” may also be used herein to refer to non-volatile mass storage such as physical storage media. If the computing system is distributed, the processing, memory and/or storage capability may be distributed as well.
- executable module can refer to software objects, routings, or methods that may be executed on the computing system.
- the different components, modules, engines, and services described herein may be implemented as objects or processes that execute on the computing system (e.g., as separate threads).
- embodiments are described with reference to acts that are performed by one or more computing systems. If such acts are implemented in software, one or more processors of the associated computing system that performs the act direct the operation of the computing system in response to having executed computer-executable instructions.
- such computer-executable instructions may be embodied on one or more computer-readable media that form a computer program product.
- An example of such an operation involves the manipulation of data.
- the computer-executable instructions (and the manipulated data) may be stored in the memory 103 A of the computing system 101 A.
- Computing system 101 A may also contain communication channels that allow the computing system 101 A to communicate with other message processors over a wired or wireless network.
- Embodiments described herein may comprise or utilize a special-purpose or general-purpose computer system that includes computer hardware, such as, for example, one or more processors and system memory, as discussed in greater detail below.
- the system memory may be included within the overall memory 103 A.
- the system memory may also be referred to as “main memory”, and includes memory locations that are addressable by the at least one processing unit 102 A over a memory bus in which case the address location is asserted on the memory bus itself.
- System memory has been traditional volatile, but the principles described herein also apply in circumstances in which the system memory is partially, or even fully, non-volatile.
- Embodiments within the scope of the present invention also include physical and other computer-readable media for carrying or storing computer-executable instructions and/or data structures.
- Such computer-readable media can be any available media that can be accessed by a general-purpose or special-purpose computer system.
- Computer-readable media that store computer-executable instructions and/or data structures are computer storage media.
- Computer-readable media that carry computer-executable instructions and/or data structures are transmission media.
- embodiments of the invention can comprise at least two distinctly different kinds of computer-readable media: computer storage media and transmission media.
- Computer storage media are physical hardware storage media that store computer-executable instructions and/or data structures.
- Physical hardware storage media include computer hardware, such as RAM, ROM, EEPROM, solid state drives (“SSDs”), flash memory, phase-change memory (“PCM”), optical disk storage, magnetic disk storage or other magnetic storage devices, or any other hardware storage device(s) which can be used to store program code in the form of computer-executable instructions or data structures, which can be accessed and executed by a general-purpose or special-purpose computer system to implement the disclosed functionality of the invention.
- Transmission media can include a network and/or data links which can be used to carry program code in the form of computer-executable instructions or data structures, and which can be accessed by a general-purpose or special-purpose computer system.
- a “network” is defined as one or more data links that enable the transport of electronic data between computer systems and/or modules and/or other electronic devices.
- program code in the form of computer-executable instructions or data structures can be transferred automatically from transmission media to computer storage media (or vice versa).
- program code in the form of computer-executable instructions or data structures received over a network or data link can be buffered in RAM within a network interface module (e.g., a “NIC”), and then eventually transferred to computer system RAM and/or to less volatile computer storage media at a computer system.
- a network interface module e.g., a “NIC”
- computer storage media can be included in computer system components that also (or even primarily) utilize transmission media.
- Computer-executable instructions comprise, for example, instructions and data which, when executed at one or more processors, cause a general-purpose computer system, special-purpose computer system, or special-purpose processing device to perform a certain function or group of functions.
- Computer-executable instructions may be, for example, binaries, intermediate format instructions such as assembly language, or even source code.
- a computer system may include a plurality of constituent computer systems.
- program modules may be located in both local and remote memory storage devices.
- Cloud computing environments may be distributed, although this is not required. When distributed, cloud computing environments may be distributed internationally within an organization and/or have components possessed across multiple organizations.
- cloud computing is defined as a model for enabling on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services). The definition of “cloud computing” is not limited to any of the other numerous advantages that can be obtained from such a model when properly deployed.
- system architectures described herein can include a plurality of independent components that each contribute to the functionality of the system as a whole.
- This modularity allows for increased flexibility when approaching issues of platform scalability and, to this end, provides a variety of advantages.
- System complexity and growth can be managed more easily through the use of smaller-scale parts with limited functional scope.
- Platform fault tolerance is enhanced through the use of these loosely coupled modules.
- Individual components can be grown incrementally as business needs dictate. Modular development also translates to decreased time to market for new functionality. New functionality can be added or subtracted without impacting the core system.
- FIG. 1 illustrates a computer architecture 100 in which at least one embodiment may be employed.
- Computer architecture 100 includes computer systems 101 A and 101 B.
- Computer systems 101 A and 101 B may each be any type of local or distributed computer system, including cloud computing systems.
- the computer systems includes various modules for performing different functions. For instance, communications modules 104 A/ 104 B are configured to communicate with other computer systems, typically via wired or wireless connections.
- the communications modules may, for example, receive inputs from users such as user 105 .
- the communication modules 104 A/ 104 B may also transmit data to and from each other, to and from the various data sources 125 , and to and from other computer systems not shown in FIG. 1 .
- the data sources 125 A-C may be any type of data stores including local data stores, distributed data stores, storage area networks (SANs) or other systems or means for storing data.
- Each data source 125 stores data elements 124 A-C.
- the data elements may be any portion of data including individual data files or groups of files, database columns or rows or other data structures.
- the data sources may be local or remote to first computer system 101 A and/or second computer system 101 B. Both the first and second computer systems may have access to all or portions of the data sources 125 , depending on the current computer user's data access rights.
- user 105 may wish to perform a query against one or more of the data sources 125 .
- the user may input search criteria 107 (on input 106 ) which are used by data query generating module 110 to generate a query 115 .
- the generated query 115 includes the user-specified search criteria 107 .
- the query is transmitted from the first computer system 101 A to the second computer system 101 B.
- the data query processing module 118 processes the data query 115 , accessing one or more of the data sources to satisfy the query.
- the data query processing module may determine that additional data elements relevant to the search exist, but are not currently accessible to the user. For example, the additional data elements may be secured behind a data access wall that prevents unauthorized access.
- the user may, however, be able to access these data elements upon proper authentication.
- the second computer system 101 B may send back any query results 116 that were found within data elements to which the user has access, and may further include an indication 117 of those additional data elements that are relevant to the user-provided search criteria 107 , but not currently accessible to the user.
- the indication may further indicate one or more steps the user is to take in order to access the additional data.
- connection dialog box or similar user experience
- the system described herein will display an option for the user to view data source documentation 122 , and may further display an option to initiate a defined business process (e.g. workflow entry point 123 ) to request access to the data source 125 .
- a defined business process e.g. workflow entry point 123
- a metadata catalog 119 may be provided that stores metadata 121 about data sources 125 and other objects.
- the metadata catalog may be a service, including a local service or a cloud-based service.
- the data maintained by the metadata catalog may include data source locations (e.g. data element links 120 ), business documentation 122 (along with other information about the data sources), and may include access workflow entry points 123 .
- the access workflow entry points 123 may allow the user 105 to begin a workflow that will allow them to access data that was previously inaccessible (e.g. by providing proper credentials).
- Embodiments herein also consolidate lineage (i.e. relationship) information for the various data artifacts and elements that are stored. With this information, the system can surface related and relevant information to the user.
- the system can look back in lineage information to infer the request access information.
- the system implements containers such that it understands and can interpret an element that is hosted within another element, and can propagate metadata within containers and for related data sources in this manner.
- the system can look at the data element's container (the database server) and infer metadata from the container, or from other related data sources.
- metadata 121 provided by multiple items in the lineage or container chain may be blended together to provide the end-user a full 360 degree view of the information that is available and that they are interacting with.
- the system also supports multiple automatic algorithms to infer additional metadata (certification, column names, lineage, container relationships, descriptions, etc.) by looking at the structure of the data, the structure of metadata that surrounds the current set of metadata (e.g. similar columns in the same data source), metadata that has been provided earlier by the same author or other authors that are interacting with the system, or through other means.
- the system can automatically look at the lineage of data to understand whether a current data artifact should be certified, what categorization and classification should be applied and then apply and/or certify it. If requested, the system may provide information on how the attributes were gathered and display that information to the user.
- Annotations and metadata for data sources can also be provided in social form.
- the metadata catalog run as a service within the cloud (indeed, the metadata catalog may run locally, distributed or in a federated system), multiple people that interact with the service can provide annotations (social annotations/metadata). These are then applied or, in some cases, first approved by a user, a quorum of users (e.g. people that are endorsing or using the data source(s)), or by a central authority.
- the system can also learn from this and automatically approve annotations in the future. As such, the system can learn from all the input that is provided, and can automatically identify types of data (columns, tables, datasets) by observing how authors provide this information manually and then suggesting best matches in the future.
- a client application experience allows users to share data objects.
- the client application (which may run locally, distributed or on a federated system) may send data source metadata, including data storage locations and connection details, to the cloud service.
- the client application may also be used to provide and surface metadata.
- the client application further interacts with the metadata catalog service 119 to provide the interface where the service would expose the supplemental data to the end-user.
- the client application can also be an application that sits in the cloud or on the machine of the user 105 (e.g. computer system 101 A).
- a stewardship application may also be provided (which may be different than, or part of the client application) that allows authorized users to modify or annotate data sources with additional metadata (e.g. data manager portal 301 of FIG. 3 ).
- the authorized users may modify or annotate data sources by adding documentation 122 and the entry point (such as a URL or email address) for a data access business workflow 123 .
- a data consumption/analysis application experience may be provided that retrieves data source metadata from the cloud service and presents it to the user 105 in the context of connecting to a secure data source.
- the list of known data sources 304 may be provided in the data manager portal 301 , and exposed to authorized data managers.
- authorized data managers may refer to administrators, stewards, data experts or other users that are authorized to make changes 302 to the metadata catalog 303 .
- company data managers, curators or owners may provide metadata 121 for data sources 125 , supplementing any existing metadata (such as location and type) with business- or user-friendly names and descriptions, and further provide an entry point 123 for a business process to request access to the data source.
- This entry point may be an email address, a web page URL, or any URI-addressable endpoint.
- This manager-provided metadata is stored in the metadata catalog 119 / 503 .
- connection experience will display the friendly name and business description for the data source, along with a link to request access to the data. Users who do not have the requisite permissions to access to the data source can select this link to initiate the request access workflow.
- the connection experience (both the discovery of data sources and the display of metadata and access link) can be presented in any client application that communicates with the metadata catalog, including standalone client applications and clients that are built in to other applications or services.
- a computer system performs a method for providing access to supplemental query data that is relevant to a data query.
- the computer system receives an input indicating that a data query is to be processed, the input providing one or more search criteria.
- the communications module 104 A of computer system 101 A may receive input 106 indicating that a query is to be processed.
- the input may provide various different search terms and other search criteria 107 for the search.
- the data query generating module 110 may then generate a data query 115 according to the search criteria in the received input.
- the communications module 104 A may then send the data query to the second computer system 101 B which is configured to process the data query.
- the second computer system 101 B includes a data query processing module 118 which processes the query 115 according to the search criteria 107 specified by the user 105 .
- the first computer system 101 A receives the results 116 of the data query.
- the results indicate that one or more additional data elements relevant to the search criteria have been identified.
- the data elements may be stored in a data store to which the user does not currently have access.
- the query may be processed using data elements from multiple data sources 125 , one or more of these data sources may be behind a paywall or behind some other sort of wall that prevents unauthorized access.
- These additional data elements (or an indication thereof) may be displayed to the user 105 .
- the display may thus include an indication of available data elements, and further include various steps that are to be taken in order to access the identified additional data elements.
- the user may send a data access input 108 to the first computer system 101 A.
- the data access input 108 may correspond to at least one of the steps provided in the indication of additional data elements. For example, if one of the data access steps says to provide a user name and password, or to click a button to initiate a data access workflow, or to provide an email address or other identifier, or to perform some other step, the user may do so in order to access the additional data elements.
- the user's input may be received at a user interface 111 at the first computer system 101 A, or via some other means. Once the input is received, the computer system 101 A may perform the steps indicated by the user's data access input 108 .
- the data elements may be accessed (either directly by computer system 101 A, or in some cases, the second computer system 101 B may access the data elements and send them to the first computer system 101 A) and provided to the user.
- the additional data elements 117 may be displayed in UI 111 next to the original query results 116 .
- the UI 111 may be configured to display the data access steps 112 needed to access the additional data, and may further display the additional data once it is received alongside the existing query results 116 .
- the UI can display a prompt for authentication credentials or other identifiers that would allow the user access to the additional data elements. If subsequent authentication, workflow or other steps are needed, those may also be displayed in the UI 111 .
- the UI may be generated and/or updated dynamically on-demand. Thus, if a user has entered a certain query, and has received certain results, the UI may be dynamically generated to show those results.
- the dynamically generated UI may further generate prompts, buttons or other means for allowing the user to provide inputs in order to access any additional data elements 117 indicated in the query results 116 .
- the UI 111 may further display a data source name for any or all of the additional data elements 117 .
- the source name may be a full name and path, or may be a simplified, user-friendly name that is easier to remember and understand.
- the UI may also display an option to initiate a data access workflow 123 to access resources at the named data source.
- the data access workflow may lead the user 105 through a series of steps that, when completed, provide access to the data elements. In cases where the series of steps includes authentication or other forms of access control, the authentication or access control capabilities may be provided by a third party by another module on computer system 101 A and/or 101 B.
- a data manager may be notified upon determining that a particular data access workflow has been initiated (e.g. when the data manager is the owner of the data). As such, the owner of the data may be apprised of how and when users are accessing the owner's data.
- the data access workflow may access a metadata catalog 119 to determine how to connect to specified data source 125 .
- the metadata catalog 119 is configured to store metadata that maps data elements to data sources. The metadata catalog thereby allows any client application that has access to the metadata catalog, access to the metadata 121 and the stored data elements 124 A-C.
- the metadata catalog will be explained in greater detail below.
- a computer system performs a method for generating and displaying a user interface that allows access to supplemental query data that is relevant to a search query.
- the method includes an act of receiving data query results in response to a data query that includes one or more search criteria, the data query results indicating that one or more additional data elements relevant to the search criteria have been identified, the additional data elements being stored at one or more specified data sources.
- communications module 104 A may receive data query results 116 from the communications module 104 B of computer system 101 B.
- the data query results include those data elements that corresponded to the user's search criteria 107 , and were accessible to the user.
- the data query results 116 may also include an indication that additional data elements 117 are available and are pertinent to the user's search criteria 107 .
- the first computer system 101 A may generate a UI that shows an indication of steps 112 that are to be taken in order to access the identified additional data elements.
- the user 105 may then provide an input 108 which may be received at the UI 111 .
- the input corresponds to at least one of the data access steps 112 that are to be taken to access the identified additional data elements.
- the computer system 101 A may send the received data access input 108 to the specified data sources to retrieve the identified additional data elements.
- a user may provide subscription information to access data that is only available to subscribers (e.g. movies or television shows).
- This subscription data may be sent to the user (either directly from the data sources 125 , or through the second computer system 101 B) and received at the first computer system 101 A.
- the additional data elements (in this example, the subscription data) may be displayed in UI 111 .
- the UI 111 may include a generic interface to request data from multiple different data sources. These data sources may be multiple different data source types. For instance, a data source may be a website, a cloud data provider, a local data store or other type of data source.
- the data query results 116 may include data from any number and/or type of data sources.
- users may share links to data sources, or directly to certain data elements within the data sources.
- a client application 113 may be provided which allows users to share links 120 to data objects stored in the data sources 125 . Additionally or alternatively, the client application may allow users to share data objects directly with other users.
- the client application can look at metadata 121 related to the data sources and data elements, and make determinations as to how best to access the data.
- the client application itself 113 may be a standalone application, or may be built-in as part of another application.
- the client application may monitor user inputs, and may collect data and use heuristics to update internal processes based on the accessed information. For example, if the client application determines that a certain user input is used to request certain data, or that certain requests lead to additional prompts, the client application may learn from these past scenarios, and may prompt the user for certain inputs accordingly.
- FIG. 2 a flowchart of a method 200 is illustrated for generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources.
- the method 200 will now be described with frequent reference to the components and data of environment 100 .
- Method 200 includes an act of generating a user interface (UI) that shows an indication of one or more data elements to which a metadata catalog is linked, the metadata catalog including references to one or more different data sources that store data elements (act 210 ).
- the UI 111 may thus show data elements that are included in the metadata catalog 119 .
- the metadata catalog 119 may link to data sources 125 , as well as individual data elements (e.g. via link 120 ).
- the user 105 may submit an input 109 at UI 111 which indicates that metadata for at least one of the data sources 125 is to be modified (act 220 ).
- the modification includes at least one of the following: providing additional metadata 121 for a specified data source, providing documentation 122 for a specified data source and providing an entry point for an access request workflow 123 .
- a data manager may wish to provide metadata for a given data source.
- This metadata may be shown to users that are requesting data from or are querying that data source. For instance, when a user is requesting data from data source 125 B, and the data manager has provided metadata 121 for that data source (perhaps indicating information about the data source such as its name, its general content, and methods of accessing the data), that metadata will be displayed in the UI along with the query results 116 .
- the results of the modification may be shown in the UI 111 along with the data query results 116 .
- data managers or other users may wish to provide metadata for views, queries or other objects which may be annotated with additional metadata.
- a data manager may provide an input that provides additional metadata for a particular view or query.
- the data manager's input may provide documentation about the view or query, or may provide an entry point for an access request workflow.
- substantially any metadata modifications that can be applied to data sources can similarly be applied to data queries, views or other objects.
- Method 200 further includes an act of performing the received data source metadata modification according to the received input (act 230 ) and, upon determining that the metadata has been modified, the updated metadata 114 for the data sources is displayed in the UI (act 240 ).
- the updated metadata 121 , documentation, workflow entry point 123 or other metadata modification is shown to the user in the dynamically generated UI 111 .
- the UI may further allow user 105 to modify data elements stored on the data sources. The user may make changes to UI elements, and have those changes propagated back to the data stores on which the data elements are stored.
- the UI may allow users to provide feedback regarding the metadata 121 provided by the data manager.
- the feedback provided by the user may be reviewable by the data manager.
- the data manager may make the appropriate changes.
- a user (or group of users) may be able to effect changes in the metadata 121 associated with the data sources.
- an user other than a data manager can verify the changes, such as a verified user, or a quorum of users that have approved a change.
- the metadata may indicate a variety of different things, including indicating, for example, that a data source is an authoritative source (i.e. the master source for certain data elements) among multiple sources.
- each user may be assigned its own metadata repository, and may be able to update and/or control the metadata 121 associated with the data sources.
- usage of the user's metadata repository may be monitored, and then used to improve the original metadata repository.
- the metadata catalog can learn and improve over time, both from the users' feedback and from monitoring the actual usage of the catalog.
- the updated metadata provided by the user 105 may indicate how end users are to access the data source.
- a user may use a metadata manager portal 301 to edit or annotate data sources 302 .
- These annotations are applied within the metadata catalog 303 and are provided when enumerating data sources 304 available to the user 105 and/or to other metadata managers.
- This updated metadata may provide steps that the users are to perform to access the (currently inaccessible) data.
- the steps to be taken may be displayed along with any data query results received from computer system 101 B and/or upon the occurrence of a specified event. For instance, the steps to be taken may be displayed next to, above, below or near the returned query results. Additionally or alternatively, the steps to be taken may be accessed upon the occurrence of an event such as a mouse click, mouseover, touch input or some other event.
- the steps to be taken may be provided by the first computing system 101 A, by the second computing system 101 B, by another computing system (not shown in FIG. 1 ) or by a third party. As such, currently inaccessible data may be accessed by the user after the user performs the necessary steps. In this manner, owners or managers of data sources may provide those steps that are necessary to access the data owned or managed by them. Owners can thus make their data more readily available to those who are authorized to access the data, but still need to take additional steps to access it.
- methods, systems and computer program products which provide access to supplemental query data that is relevant to a data query, and further generate and display user interfaces that allow access to supplemental query data that is relevant to a search query. Moreover, methods, systems and computer program products are provided which generate and display user interfaces that allow data managers to provide supplemental metadata for data sources.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Human Computer Interaction (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
- This application claims priority to and the benefit of U.S. Provisional Patent Application Ser. No. 61/875,582, entitled “Providing Supplemental Connection Data to Access Shared Resources”, filed on Sep. 9, 2013, which application is incorporated by reference in its entirety herein. This application is also related to U.S. patent application Atty. Docket No. 13768.2344.1, entitled “Interfaces for Accessing and Managing Enhanced Connection Data for Shared Resources”, and U.S. patent application Atty. Docket No. 13768.2344.2, entitled “Providing Enhanced Connection Data for Shared Resources”, both of which are filed concurrently herewith. Both related applications are incorporated by reference in their entirety herein.
- Computer databases have long been used to store and provide access to data. Computer databases allow users to enter queries and receive results that are intended to satisfy the query. In many cases, the user's query may be answered by data that is stored in multiple different databases. The user may not know that the data they desire to access is stored on these databases, and, even if the user is aware of the data's location, the user may not know how to access the data. This data is often referred to as “dark data”. The dark data is data that is required (or is at least useful) in answering the user's query, but is not accessible by the user.
- Embodiments described herein are directed to generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources. In one embodiment, a computer system generates a UI that shows an indication of data elements to which a metadata catalog is linked. The metadata catalog includes references to different data sources that store data elements. The UI receives an input indicating that metadata for at least one of the data sources is to be modified. The modification includes providing additional metadata for a specified data source, providing documentation for a specified data source and/or providing an entry point for an access request workflow. The computer system performs the received data source metadata modification according to the received input and, upon determining that the metadata has been modified, displays the updated metadata for the data sources in the UI.
- This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.
- Additional features and advantages will be set forth in the description which follows, and in part will be apparent to one of ordinary skill in the art from the description, or may be learned by the practice of the teachings herein. Features and advantages of embodiments described herein may be realized and obtained by means of the instruments and combinations particularly pointed out in the appended claims. Features of the embodiments described herein will become more fully apparent from the following description and appended claims.
- To further clarify the above and other features of the embodiments described herein, a more particular description will be rendered by reference to the appended drawings. It is appreciated that these drawings depict only examples of the embodiments described herein and are therefore not to be considered limiting of its scope. The embodiments will be described and explained with additional specificity and detail through the use of the accompanying drawings in which:
-
FIG. 1 illustrates a computer architecture in which embodiments described herein may operate including generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources. -
FIG. 2 illustrates a flowchart of an example method for generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources. -
FIG. 3 illustrates example interactions between a data manager portal and a metadata catalog. - Embodiments described herein are directed to generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources. In one embodiment, a computer system generates a UI that shows an indication of data elements to which a metadata catalog is linked. The metadata catalog includes references to different data sources that store data elements. The UI receives an input indicating that metadata for at least one of the data sources is to be modified. The modification includes providing additional metadata for a specified data source, providing documentation for a specified data source and/or providing an entry point for an access request workflow. The computer system performs the received data source metadata modification according to the received input and, upon determining that the metadata has been modified, displays the updated metadata for the data sources in the UI.
- The following discussion now refers to a number of methods and method acts that may be performed. It should be noted, that although the method acts may be discussed in a certain order or illustrated in a flow chart as occurring in a particular order, no particular ordering is necessarily required unless specifically stated, or required because an act is dependent on another act being completed prior to the act being performed.
- Computing systems are now increasingly taking a wide variety of forms. Computing systems may, for example, be handheld devices, appliances, laptop computers, desktop computers, mainframes, distributed computing systems, or even devices that have not conventionally been considered a computing system. In this description and in the claims, the term “computing system” is defined broadly as including any device or system (or combination thereof) that includes at least one physical and tangible processor, and a physical and tangible memory capable of having thereon computer-executable instructions that may be executed by the processor. A computing system may be distributed over a network environment and may include multiple constituent computing systems.
- As illustrated in
FIG. 1 , acomputing system 101A typically includes at least oneprocessing unit 102A andmemory 103A. The memory 103 may be physical system memory, which may be volatile, non-volatile, or some combination of the two. The term “memory” may also be used herein to refer to non-volatile mass storage such as physical storage media. If the computing system is distributed, the processing, memory and/or storage capability may be distributed as well. - As used herein, the term “executable module” or “executable component” can refer to software objects, routings, or methods that may be executed on the computing system. The different components, modules, engines, and services described herein may be implemented as objects or processes that execute on the computing system (e.g., as separate threads).
- In the description that follows, embodiments are described with reference to acts that are performed by one or more computing systems. If such acts are implemented in software, one or more processors of the associated computing system that performs the act direct the operation of the computing system in response to having executed computer-executable instructions. For example, such computer-executable instructions may be embodied on one or more computer-readable media that form a computer program product. An example of such an operation involves the manipulation of data. The computer-executable instructions (and the manipulated data) may be stored in the
memory 103A of thecomputing system 101A.Computing system 101A may also contain communication channels that allow thecomputing system 101A to communicate with other message processors over a wired or wireless network. - Embodiments described herein may comprise or utilize a special-purpose or general-purpose computer system that includes computer hardware, such as, for example, one or more processors and system memory, as discussed in greater detail below. The system memory may be included within the
overall memory 103A. The system memory may also be referred to as “main memory”, and includes memory locations that are addressable by the at least oneprocessing unit 102A over a memory bus in which case the address location is asserted on the memory bus itself. System memory has been traditional volatile, but the principles described herein also apply in circumstances in which the system memory is partially, or even fully, non-volatile. - Embodiments within the scope of the present invention also include physical and other computer-readable media for carrying or storing computer-executable instructions and/or data structures. Such computer-readable media can be any available media that can be accessed by a general-purpose or special-purpose computer system. Computer-readable media that store computer-executable instructions and/or data structures are computer storage media. Computer-readable media that carry computer-executable instructions and/or data structures are transmission media. Thus, by way of example, and not limitation, embodiments of the invention can comprise at least two distinctly different kinds of computer-readable media: computer storage media and transmission media.
- Computer storage media are physical hardware storage media that store computer-executable instructions and/or data structures. Physical hardware storage media include computer hardware, such as RAM, ROM, EEPROM, solid state drives (“SSDs”), flash memory, phase-change memory (“PCM”), optical disk storage, magnetic disk storage or other magnetic storage devices, or any other hardware storage device(s) which can be used to store program code in the form of computer-executable instructions or data structures, which can be accessed and executed by a general-purpose or special-purpose computer system to implement the disclosed functionality of the invention.
- Transmission media can include a network and/or data links which can be used to carry program code in the form of computer-executable instructions or data structures, and which can be accessed by a general-purpose or special-purpose computer system. A “network” is defined as one or more data links that enable the transport of electronic data between computer systems and/or modules and/or other electronic devices. When information is transferred or provided over a network or another communications connection (either hardwired, wireless, or a combination of hardwired or wireless) to a computer system, the computer system may view the connection as transmission media. Combinations of the above should also be included within the scope of computer-readable media.
- Further, upon reaching various computer system components, program code in the form of computer-executable instructions or data structures can be transferred automatically from transmission media to computer storage media (or vice versa). For example, computer-executable instructions or data structures received over a network or data link can be buffered in RAM within a network interface module (e.g., a “NIC”), and then eventually transferred to computer system RAM and/or to less volatile computer storage media at a computer system. Thus, it should be understood that computer storage media can be included in computer system components that also (or even primarily) utilize transmission media.
- Computer-executable instructions comprise, for example, instructions and data which, when executed at one or more processors, cause a general-purpose computer system, special-purpose computer system, or special-purpose processing device to perform a certain function or group of functions. Computer-executable instructions may be, for example, binaries, intermediate format instructions such as assembly language, or even source code.
- Those skilled in the art will appreciate that the principles described herein may be practiced in network computing environments with many types of computer system configurations, including, personal computers, desktop computers, laptop computers, message processors, hand-held devices, multi-processor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, mobile telephones, PDAs, tablets, pagers, routers, switches, and the like. The invention may also be practiced in distributed system environments where local and remote computer systems, which are linked (either by hardwired data links, wireless data links, or by a combination of hardwired and wireless data links) through a network, both perform tasks. As such, in a distributed system environment, a computer system may include a plurality of constituent computer systems. In a distributed system environment, program modules may be located in both local and remote memory storage devices.
- Those skilled in the art will also appreciate that the invention may be practiced in a cloud computing environment. Cloud computing environments may be distributed, although this is not required. When distributed, cloud computing environments may be distributed internationally within an organization and/or have components possessed across multiple organizations. In this description and the following claims, “cloud computing” is defined as a model for enabling on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services). The definition of “cloud computing” is not limited to any of the other numerous advantages that can be obtained from such a model when properly deployed.
- Still further, system architectures described herein can include a plurality of independent components that each contribute to the functionality of the system as a whole. This modularity allows for increased flexibility when approaching issues of platform scalability and, to this end, provides a variety of advantages. System complexity and growth can be managed more easily through the use of smaller-scale parts with limited functional scope. Platform fault tolerance is enhanced through the use of these loosely coupled modules. Individual components can be grown incrementally as business needs dictate. Modular development also translates to decreased time to market for new functionality. New functionality can be added or subtracted without impacting the core system.
-
FIG. 1 illustrates acomputer architecture 100 in which at least one embodiment may be employed.Computer architecture 100 includescomputer systems Computer systems communications modules 104A/104B are configured to communicate with other computer systems, typically via wired or wireless connections. The communications modules may, for example, receive inputs from users such asuser 105. Thecommunication modules 104A/104B may also transmit data to and from each other, to and from thevarious data sources 125, and to and from other computer systems not shown inFIG. 1 . - The data sources 125A-C may be any type of data stores including local data stores, distributed data stores, storage area networks (SANs) or other systems or means for storing data. Each data source 125
stores data elements 124A-C. The data elements may be any portion of data including individual data files or groups of files, database columns or rows or other data structures. The data sources may be local or remote tofirst computer system 101A and/orsecond computer system 101B. Both the first and second computer systems may have access to all or portions of thedata sources 125, depending on the current computer user's data access rights. - For instance,
user 105 may wish to perform a query against one or more of the data sources 125. The user may input search criteria 107 (on input 106) which are used by data query generatingmodule 110 to generate aquery 115. The generatedquery 115 includes the user-specifiedsearch criteria 107. The query is transmitted from thefirst computer system 101A to thesecond computer system 101B. The dataquery processing module 118 processes thedata query 115, accessing one or more of the data sources to satisfy the query. In some cases, the data query processing module may determine that additional data elements relevant to the search exist, but are not currently accessible to the user. For example, the additional data elements may be secured behind a data access wall that prevents unauthorized access. The user may, however, be able to access these data elements upon proper authentication. Thesecond computer system 101B may send back anyquery results 116 that were found within data elements to which the user has access, and may further include anindication 117 of those additional data elements that are relevant to the user-providedsearch criteria 107, but not currently accessible to the user. The indication may further indicate one or more steps the user is to take in order to access the additional data. - In this manner, embodiments described herein provide business users and other end-users with relevant information when they attempt to connect to a data source. For example, when a connection dialog box (or similar user experience) is displayed to a user (e.g. 105), the system described herein will display an option for the user to view
data source documentation 122, and may further display an option to initiate a defined business process (e.g. workflow entry point 123) to request access to thedata source 125. - A
metadata catalog 119 may be provided that stores metadata 121 aboutdata sources 125 and other objects. The metadata catalog may be a service, including a local service or a cloud-based service. The data maintained by the metadata catalog may include data source locations (e.g. data element links 120), business documentation 122 (along with other information about the data sources), and may include access workflow entry points 123. The access workflow entry points 123 may allow theuser 105 to begin a workflow that will allow them to access data that was previously inaccessible (e.g. by providing proper credentials). Embodiments herein also consolidate lineage (i.e. relationship) information for the various data artifacts and elements that are stored. With this information, the system can surface related and relevant information to the user. - For example, if the query does not have a specific request to access information, the system can look back in lineage information to infer the request access information. Moreover, the system implements containers such that it understands and can interpret an element that is hosted within another element, and can propagate metadata within containers and for related data sources in this manner. As such, if metadata has not been provided for a database, the system can look at the data element's container (the database server) and infer metadata from the container, or from other related data sources.
- In some embodiments,
metadata 121 provided by multiple items in the lineage or container chain may be blended together to provide the end-user a full 360 degree view of the information that is available and that they are interacting with. The system also supports multiple automatic algorithms to infer additional metadata (certification, column names, lineage, container relationships, descriptions, etc.) by looking at the structure of the data, the structure of metadata that surrounds the current set of metadata (e.g. similar columns in the same data source), metadata that has been provided earlier by the same author or other authors that are interacting with the system, or through other means. The system can automatically look at the lineage of data to understand whether a current data artifact should be certified, what categorization and classification should be applied and then apply and/or certify it. If requested, the system may provide information on how the attributes were gathered and display that information to the user. - Annotations and metadata for data sources can also be provided in social form. As at least some embodiments of the metadata catalog run as a service within the cloud (indeed, the metadata catalog may run locally, distributed or in a federated system), multiple people that interact with the service can provide annotations (social annotations/metadata). These are then applied or, in some cases, first approved by a user, a quorum of users (e.g. people that are endorsing or using the data source(s)), or by a central authority. The system can also learn from this and automatically approve annotations in the future. As such, the system can learn from all the input that is provided, and can automatically identify types of data (columns, tables, datasets) by observing how authors provide this information manually and then suggesting best matches in the future.
- A client application experience allows users to share data objects. The client application (which may run locally, distributed or on a federated system) may send data source metadata, including data storage locations and connection details, to the cloud service. The client application may also be used to provide and surface metadata. The client application further interacts with the
metadata catalog service 119 to provide the interface where the service would expose the supplemental data to the end-user. The client application can also be an application that sits in the cloud or on the machine of the user 105 (e.g. computer system 101A). - A stewardship application may also be provided (which may be different than, or part of the client application) that allows authorized users to modify or annotate data sources with additional metadata (e.g. data manager portal 301 of
FIG. 3 ). The authorized users may modify or annotate data sources by addingdocumentation 122 and the entry point (such as a URL or email address) for a dataaccess business workflow 123. Still further, a data consumption/analysis application experience may be provided that retrieves data source metadata from the cloud service and presents it to theuser 105 in the context of connecting to a secure data source. - As illustrated in
FIG. 3 , the list of known data sources 304 may be provided in the data manager portal 301, and exposed to authorized data managers. As used herein, the term “authorized data managers” may refer to administrators, stewards, data experts or other users that are authorized to makechanges 302 to themetadata catalog 303. Using the data manager portal, company data managers, curators or owners may providemetadata 121 fordata sources 125, supplementing any existing metadata (such as location and type) with business- or user-friendly names and descriptions, and further provide anentry point 123 for a business process to request access to the data source. This entry point may be an email address, a web page URL, or any URI-addressable endpoint. This manager-provided metadata is stored in themetadata catalog 119/503. - Subsequently, when users use the client application to connect to a data source (e.g. 125A), the connection experience will display the friendly name and business description for the data source, along with a link to request access to the data. Users who do not have the requisite permissions to access to the data source can select this link to initiate the request access workflow. The connection experience (both the discovery of data sources and the display of metadata and access link) can be presented in any client application that communicates with the metadata catalog, including standalone client applications and clients that are built in to other applications or services.
- In one embodiment, a computer system performs a method for providing access to supplemental query data that is relevant to a data query. The computer system (e.g. 101A) receives an input indicating that a data query is to be processed, the input providing one or more search criteria. For example, the
communications module 104A ofcomputer system 101A may receiveinput 106 indicating that a query is to be processed. The input may provide various different search terms andother search criteria 107 for the search. The data query generatingmodule 110 may then generate adata query 115 according to the search criteria in the received input. Thecommunications module 104A may then send the data query to thesecond computer system 101B which is configured to process the data query. Thesecond computer system 101B includes a dataquery processing module 118 which processes thequery 115 according to thesearch criteria 107 specified by theuser 105. - After the query has been processed, and the relevant data corresponding to the query has been retrieved, the
first computer system 101A receives theresults 116 of the data query. The results indicate that one or more additional data elements relevant to the search criteria have been identified. For example, the data elements may be stored in a data store to which the user does not currently have access. As the query may be processed using data elements frommultiple data sources 125, one or more of these data sources may be behind a paywall or behind some other sort of wall that prevents unauthorized access. These additional data elements (or an indication thereof) may be displayed to theuser 105. The display may thus include an indication of available data elements, and further include various steps that are to be taken in order to access the identified additional data elements. - After viewing the steps that are to be taken to access one or more of the
additional data elements 117, the user may send adata access input 108 to thefirst computer system 101A. Thedata access input 108 may correspond to at least one of the steps provided in the indication of additional data elements. For example, if one of the data access steps says to provide a user name and password, or to click a button to initiate a data access workflow, or to provide an email address or other identifier, or to perform some other step, the user may do so in order to access the additional data elements. The user's input may be received at auser interface 111 at thefirst computer system 101A, or via some other means. Once the input is received, thecomputer system 101A may perform the steps indicated by the user'sdata access input 108. Once the process has successfully completed, and the user has been granted access to the data store(s) storing the additional data elements, the data elements may be accessed (either directly bycomputer system 101A, or in some cases, thesecond computer system 101B may access the data elements and send them to thefirst computer system 101A) and provided to the user. In some embodiments, theadditional data elements 117 may be displayed inUI 111 next to the original query results 116. - As such, the
UI 111 may be configured to display the data access steps 112 needed to access the additional data, and may further display the additional data once it is received alongside the existing query results 116. The UI can display a prompt for authentication credentials or other identifiers that would allow the user access to the additional data elements. If subsequent authentication, workflow or other steps are needed, those may also be displayed in theUI 111. The UI may be generated and/or updated dynamically on-demand. Thus, if a user has entered a certain query, and has received certain results, the UI may be dynamically generated to show those results. The dynamically generated UI may further generate prompts, buttons or other means for allowing the user to provide inputs in order to access anyadditional data elements 117 indicated in the query results 116. - The
UI 111 may further display a data source name for any or all of theadditional data elements 117. The source name may be a full name and path, or may be a simplified, user-friendly name that is easier to remember and understand. The UI may also display an option to initiate adata access workflow 123 to access resources at the named data source. The data access workflow may lead theuser 105 through a series of steps that, when completed, provide access to the data elements. In cases where the series of steps includes authentication or other forms of access control, the authentication or access control capabilities may be provided by a third party by another module oncomputer system 101A and/or 101B. In some cases, a data manager may be notified upon determining that a particular data access workflow has been initiated (e.g. when the data manager is the owner of the data). As such, the owner of the data may be apprised of how and when users are accessing the owner's data. - Still further, in some cases, the data access workflow may access a
metadata catalog 119 to determine how to connect to specifieddata source 125. As mentioned above, themetadata catalog 119 is configured to store metadata that maps data elements to data sources. The metadata catalog thereby allows any client application that has access to the metadata catalog, access to themetadata 121 and the storeddata elements 124A-C. The metadata catalog will be explained in greater detail below. - In another embodiment, a computer system performs a method for generating and displaying a user interface that allows access to supplemental query data that is relevant to a search query. The method includes an act of receiving data query results in response to a data query that includes one or more search criteria, the data query results indicating that one or more additional data elements relevant to the search criteria have been identified, the additional data elements being stored at one or more specified data sources. For example,
communications module 104A may receive data query results 116 from thecommunications module 104B ofcomputer system 101B. The data query results include those data elements that corresponded to the user'ssearch criteria 107, and were accessible to the user. The data queryresults 116 may also include an indication thatadditional data elements 117 are available and are pertinent to the user'ssearch criteria 107. Thefirst computer system 101A may generate a UI that shows an indication of steps 112 that are to be taken in order to access the identified additional data elements. - The
user 105 may then provide aninput 108 which may be received at theUI 111. The input corresponds to at least one of the data access steps 112 that are to be taken to access the identified additional data elements. Upon determining that the input received at theUI 111 allows access to the specified data sources (e.g. 125A-C), thecomputer system 101A may send the receiveddata access input 108 to the specified data sources to retrieve the identified additional data elements. Thus, for example, a user may provide subscription information to access data that is only available to subscribers (e.g. movies or television shows). This subscription data, along with any other additional data elements that are accessible to the user, may be sent to the user (either directly from thedata sources 125, or through thesecond computer system 101B) and received at thefirst computer system 101A. The additional data elements (in this example, the subscription data) may be displayed inUI 111. - In some embodiments, the
UI 111 may include a generic interface to request data from multiple different data sources. These data sources may be multiple different data source types. For instance, a data source may be a website, a cloud data provider, a local data store or other type of data source. The data queryresults 116 may include data from any number and/or type of data sources. In some cases, users may share links to data sources, or directly to certain data elements within the data sources. Aclient application 113 may be provided which allows users to sharelinks 120 to data objects stored in the data sources 125. Additionally or alternatively, the client application may allow users to share data objects directly with other users. The client application can look atmetadata 121 related to the data sources and data elements, and make determinations as to how best to access the data. The client application itself 113 may be a standalone application, or may be built-in as part of another application. The client application may monitor user inputs, and may collect data and use heuristics to update internal processes based on the accessed information. For example, if the client application determines that a certain user input is used to request certain data, or that certain requests lead to additional prompts, the client application may learn from these past scenarios, and may prompt the user for certain inputs accordingly. - The concepts described above will be explained further below with regard to
method 200 ofFIG. 2 . In view of the systems and architectures described above, methodologies that may be implemented in accordance with the disclosed subject matter will be better appreciated with reference to the flow chart ofFIG. 2 . For purposes of simplicity of explanation, the methodologies are shown and described as a series of blocks. However, it should be understood and appreciated that the claimed subject matter is not limited by the order of the blocks, as some blocks may occur in different orders and/or concurrently with other blocks from what is depicted and described herein. Moreover, not all illustrated blocks may be required to implement the methodologies described hereinafter. - Turning now to
FIG. 2 , a flowchart of amethod 200 is illustrated for generating and displaying a user interface that allows data managers to provide supplemental metadata for data sources. Themethod 200 will now be described with frequent reference to the components and data ofenvironment 100. -
Method 200 includes an act of generating a user interface (UI) that shows an indication of one or more data elements to which a metadata catalog is linked, the metadata catalog including references to one or more different data sources that store data elements (act 210). TheUI 111 may thus show data elements that are included in themetadata catalog 119. Themetadata catalog 119 may link todata sources 125, as well as individual data elements (e.g. via link 120). Theuser 105 may submit aninput 109 atUI 111 which indicates that metadata for at least one of thedata sources 125 is to be modified (act 220). The modification includes at least one of the following: providingadditional metadata 121 for a specified data source, providingdocumentation 122 for a specified data source and providing an entry point for anaccess request workflow 123. - For example, a data manager may wish to provide metadata for a given data source. This metadata may be shown to users that are requesting data from or are querying that data source. For instance, when a user is requesting data from
data source 125B, and the data manager has providedmetadata 121 for that data source (perhaps indicating information about the data source such as its name, its general content, and methods of accessing the data), that metadata will be displayed in the UI along with the query results 116. Similarly, if a data manager addsdocumentation 122 regarding a data source, adds a data accessworkflow entry point 123, or performs some other form of metadata modification, the results of the modification may be shown in theUI 111 along with the data query results 116. - Still further, it should be noted that data managers or other users may wish to provide metadata for views, queries or other objects which may be annotated with additional metadata. For example, a data manager may provide an input that provides additional metadata for a particular view or query. Additionally or alternatively, the data manager's input may provide documentation about the view or query, or may provide an entry point for an access request workflow. Indeed, substantially any metadata modifications that can be applied to data sources can similarly be applied to data queries, views or other objects.
-
Method 200 further includes an act of performing the received data source metadata modification according to the received input (act 230) and, upon determining that the metadata has been modified, the updatedmetadata 114 for the data sources is displayed in the UI (act 240). Thus, the updatedmetadata 121, documentation,workflow entry point 123 or other metadata modification is shown to the user in the dynamically generatedUI 111. The UI may further allowuser 105 to modify data elements stored on the data sources. The user may make changes to UI elements, and have those changes propagated back to the data stores on which the data elements are stored. - Still further, the UI may allow users to provide feedback regarding the
metadata 121 provided by the data manager. The feedback provided by the user may be reviewable by the data manager. As such, if the user suggested changing a column name, for example, and the data manager noticed that multiple users had requested changing that column name, the data manager may make the appropriate changes. As such, a user (or group of users) may be able to effect changes in themetadata 121 associated with the data sources. In some cases, an user other than a data manager can verify the changes, such as a verified user, or a quorum of users that have approved a change. The metadata may indicate a variety of different things, including indicating, for example, that a data source is an authoritative source (i.e. the master source for certain data elements) among multiple sources. In some cases, each user may be assigned its own metadata repository, and may be able to update and/or control themetadata 121 associated with the data sources. In such cases, usage of the user's metadata repository may be monitored, and then used to improve the original metadata repository. In this manner, the metadata catalog can learn and improve over time, both from the users' feedback and from monitoring the actual usage of the catalog. - In some cases, the updated metadata provided by the user 105 (i.e. metadata modification input 109) may indicate how end users are to access the data source. For example, as shown in
FIG. 3 , a user may use a metadata manager portal 301 to edit or annotatedata sources 302. These annotations are applied within themetadata catalog 303 and are provided when enumerating data sources 304 available to theuser 105 and/or to other metadata managers. This updated metadata may provide steps that the users are to perform to access the (currently inaccessible) data. - The steps to be taken may be displayed along with any data query results received from
computer system 101B and/or upon the occurrence of a specified event. For instance, the steps to be taken may be displayed next to, above, below or near the returned query results. Additionally or alternatively, the steps to be taken may be accessed upon the occurrence of an event such as a mouse click, mouseover, touch input or some other event. The steps to be taken may be provided by thefirst computing system 101A, by thesecond computing system 101B, by another computing system (not shown inFIG. 1 ) or by a third party. As such, currently inaccessible data may be accessed by the user after the user performs the necessary steps. In this manner, owners or managers of data sources may provide those steps that are necessary to access the data owned or managed by them. Owners can thus make their data more readily available to those who are authorized to access the data, but still need to take additional steps to access it. - Accordingly, methods, systems and computer program products are provided which provide access to supplemental query data that is relevant to a data query, and further generate and display user interfaces that allow access to supplemental query data that is relevant to a search query. Moreover, methods, systems and computer program products are provided which generate and display user interfaces that allow data managers to provide supplemental metadata for data sources.
- The concepts and features described herein may be embodied in other specific forms without departing from their spirit or descriptive characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the disclosure is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.
Claims (20)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/082,481 US20150074565A1 (en) | 2013-09-09 | 2013-11-18 | Interfaces for providing enhanced connection data for shared resources |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201361875582P | 2013-09-09 | 2013-09-09 | |
US14/082,481 US20150074565A1 (en) | 2013-09-09 | 2013-11-18 | Interfaces for providing enhanced connection data for shared resources |
Publications (1)
Publication Number | Publication Date |
---|---|
US20150074565A1 true US20150074565A1 (en) | 2015-03-12 |
Family
ID=52626569
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/082,481 Abandoned US20150074565A1 (en) | 2013-09-09 | 2013-11-18 | Interfaces for providing enhanced connection data for shared resources |
US14/082,401 Active 2034-07-31 US9594812B2 (en) | 2013-09-09 | 2013-11-18 | Interfaces for accessing and managing enhanced connection data for shared resources |
US14/082,451 Expired - Fee Related US9547699B2 (en) | 2013-09-09 | 2013-11-18 | Providing enhanced connection data for shared resources |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/082,401 Active 2034-07-31 US9594812B2 (en) | 2013-09-09 | 2013-11-18 | Interfaces for accessing and managing enhanced connection data for shared resources |
US14/082,451 Expired - Fee Related US9547699B2 (en) | 2013-09-09 | 2013-11-18 | Providing enhanced connection data for shared resources |
Country Status (1)
Country | Link |
---|---|
US (3) | US20150074565A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10152512B2 (en) * | 2016-12-28 | 2018-12-11 | Accenture Global Solutions Limited | Metadata-driven program code generation for clinical data analysis |
GB2574903A (en) * | 2018-06-18 | 2019-12-25 | Arm Ip Ltd | Compilable data model |
CN111526198A (en) * | 2020-04-24 | 2020-08-11 | 深圳融安网络科技有限公司 | Data synchronization method and device for server and computer readable storage medium |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102774097B1 (en) * | 2019-03-22 | 2025-03-04 | 삼성전자주식회사 | Electronic device and control method thereof |
US12032696B2 (en) * | 2020-09-30 | 2024-07-09 | EMC IP Holding Company LLC | Confidence-enabled data storage systems |
US11973653B2 (en) | 2020-11-11 | 2024-04-30 | Mapped Inc. | Automatic discovery of automated digital systems through link salience |
US11709813B2 (en) * | 2021-06-10 | 2023-07-25 | Jpmorgan Chase Bank, N.A. | System and method for implementing a contract data management module |
WO2023215892A1 (en) | 2022-05-06 | 2023-11-09 | Mapped Inc. | Ensemble learning for extracting semantics of data in building systems |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20010047279A1 (en) * | 2000-04-13 | 2001-11-29 | Gargone Peter Sebastian | Automating high-level business functions in a generic manner |
US6438580B1 (en) * | 1998-03-30 | 2002-08-20 | Electronic Data Systems Corporation | System and method for an interactive knowledgebase |
US20030050976A1 (en) * | 1999-12-10 | 2003-03-13 | Myteam.Com | Structure for accessing and populating community websites |
US20030074277A1 (en) * | 2001-10-16 | 2003-04-17 | Foutz Gregory L. | Method and apparatus for automatically reviewing application information and preparing responsive communications |
US6901394B2 (en) * | 2000-06-30 | 2005-05-31 | Askme Corporation | Method and system for enhanced knowledge management |
US20090077043A1 (en) * | 2003-01-17 | 2009-03-19 | International Business Machines Corporation | System and method for accessing non-compatible content repositories |
US20100161382A1 (en) * | 2008-10-27 | 2010-06-24 | Survcast, Inc. | Content management |
US20110302290A1 (en) * | 2010-06-07 | 2011-12-08 | Novell, Inc. | System and method for managing changes in a network datacenter |
US20120185779A1 (en) * | 2011-01-13 | 2012-07-19 | International Business Machines Corporation | Computer System and Method of Audience-Suggested Content Creation in Social Media |
US8856165B1 (en) * | 2010-03-26 | 2014-10-07 | Google Inc. | Ranking of users who report abuse |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8650152B2 (en) * | 2004-05-28 | 2014-02-11 | International Business Machines Corporation | Method and system for managing execution of data driven workflows |
US7584177B2 (en) * | 2005-06-29 | 2009-09-01 | Google Inc. | Determination of a desired repository |
US7770220B2 (en) * | 2005-08-16 | 2010-08-03 | Xerox Corp | System and method for securing documents using an attached electronic data storage device |
US20070043735A1 (en) | 2005-08-19 | 2007-02-22 | Bodin William K | Aggregating data of disparate data types from disparate data sources |
US20070162417A1 (en) * | 2006-01-10 | 2007-07-12 | Kabushiki Kaisha Toshiba | System and method for selective access to restricted electronic documents |
US20070233685A1 (en) * | 2006-03-31 | 2007-10-04 | Microsoft Corporation | Displaying access rights on search results pages |
US20080040388A1 (en) | 2006-08-04 | 2008-02-14 | Jonah Petri | Methods and systems for tracking document lineage |
US8751507B2 (en) | 2007-06-29 | 2014-06-10 | Amazon Technologies, Inc. | Recommendation system with multiple integrated recommenders |
CA2702651C (en) * | 2007-10-15 | 2019-07-02 | Lexisnexis Group | System and method for searching for documents |
US9020913B2 (en) * | 2007-10-25 | 2015-04-28 | International Business Machines Corporation | Real-time interactive authorization for enterprise search |
US20100057686A1 (en) * | 2008-09-03 | 2010-03-04 | Oded Haim Breiner | Deep Web Search |
US8200661B1 (en) * | 2008-12-18 | 2012-06-12 | Google Inc. | Dynamic recommendations based on user actions |
US8171337B2 (en) | 2009-03-30 | 2012-05-01 | The Boeing Company | Computer architectures using shared storage |
US8447760B1 (en) | 2009-07-20 | 2013-05-21 | Google Inc. | Generating a related set of documents for an initial set of documents |
US20110231317A1 (en) | 2010-03-19 | 2011-09-22 | Wihem Arsac | Security sensitive data flow analysis |
US20110314071A1 (en) | 2010-06-17 | 2011-12-22 | Openwave Systems Inc. | Metadata-based data access and control |
US8955037B2 (en) | 2011-05-11 | 2015-02-10 | Oracle International Corporation | Access management architecture |
US8521655B2 (en) | 2011-06-06 | 2013-08-27 | Bizequity Llc | Engine, system and method for providing cloud-based business intelligence |
US20140164080A1 (en) * | 2011-06-07 | 2014-06-12 | Solomon Thompson, JR. | Organizational tools and or a collaboration system utilizing the same therein |
US8713076B2 (en) | 2012-01-20 | 2014-04-29 | Cross Commerce Media, Inc. | Providing a multi-tenant knowledge network |
US8478693B1 (en) | 2012-02-13 | 2013-07-02 | Google Inc. | Framework for specifying access to protected content |
US9367646B2 (en) * | 2013-03-14 | 2016-06-14 | Appsense Limited | Document and user metadata storage |
US9043942B1 (en) * | 2013-06-19 | 2015-05-26 | Inkling Systems, Inc. | Methods and systems for facilitating limited access to paid-access or registration-required content items via search engine results, direct web addressing or indirect linking |
-
2013
- 2013-11-18 US US14/082,481 patent/US20150074565A1/en not_active Abandoned
- 2013-11-18 US US14/082,401 patent/US9594812B2/en active Active
- 2013-11-18 US US14/082,451 patent/US9547699B2/en not_active Expired - Fee Related
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6438580B1 (en) * | 1998-03-30 | 2002-08-20 | Electronic Data Systems Corporation | System and method for an interactive knowledgebase |
US20030050976A1 (en) * | 1999-12-10 | 2003-03-13 | Myteam.Com | Structure for accessing and populating community websites |
US20010047279A1 (en) * | 2000-04-13 | 2001-11-29 | Gargone Peter Sebastian | Automating high-level business functions in a generic manner |
US6901394B2 (en) * | 2000-06-30 | 2005-05-31 | Askme Corporation | Method and system for enhanced knowledge management |
US20030074277A1 (en) * | 2001-10-16 | 2003-04-17 | Foutz Gregory L. | Method and apparatus for automatically reviewing application information and preparing responsive communications |
US20090077043A1 (en) * | 2003-01-17 | 2009-03-19 | International Business Machines Corporation | System and method for accessing non-compatible content repositories |
US20100161382A1 (en) * | 2008-10-27 | 2010-06-24 | Survcast, Inc. | Content management |
US8856165B1 (en) * | 2010-03-26 | 2014-10-07 | Google Inc. | Ranking of users who report abuse |
US20110302290A1 (en) * | 2010-06-07 | 2011-12-08 | Novell, Inc. | System and method for managing changes in a network datacenter |
US20120185779A1 (en) * | 2011-01-13 | 2012-07-19 | International Business Machines Corporation | Computer System and Method of Audience-Suggested Content Creation in Social Media |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10152512B2 (en) * | 2016-12-28 | 2018-12-11 | Accenture Global Solutions Limited | Metadata-driven program code generation for clinical data analysis |
GB2574903A (en) * | 2018-06-18 | 2019-12-25 | Arm Ip Ltd | Compilable data model |
CN111526198A (en) * | 2020-04-24 | 2020-08-11 | 深圳融安网络科技有限公司 | Data synchronization method and device for server and computer readable storage medium |
Also Published As
Publication number | Publication date |
---|---|
US9547699B2 (en) | 2017-01-17 |
US20150074096A1 (en) | 2015-03-12 |
US20150074078A1 (en) | 2015-03-12 |
US9594812B2 (en) | 2017-03-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9594812B2 (en) | Interfaces for accessing and managing enhanced connection data for shared resources | |
US11575772B2 (en) | Systems and methods for initiating processing actions utilizing automatically generated data of a group-based communication system | |
US8819068B1 (en) | Automating creation or modification of database objects | |
US9158799B2 (en) | Storing and retrieving context sensitive data in a management system | |
Mwilu et al. | Design science research contribution to business intelligence in the cloud—A systematic literature review | |
US9275121B2 (en) | Interoperable shared query based on heterogeneous data sources | |
US20140330821A1 (en) | Recommending context based actions for data visualizations | |
US20170078429A1 (en) | Cross domain in-browser proxy | |
US20170206684A1 (en) | Intelligent container for analytic visualizations | |
US20110276583A1 (en) | Automatic role determination for search configuration | |
US11314611B2 (en) | Universal identity management or governance over applications | |
US10768794B2 (en) | Truncated synchronization of data object instances | |
US10719555B2 (en) | System and method in a database system for sharing a data item with an entity in another tenant domain | |
US12242434B2 (en) | Reducing reliance on content management system resources in a content management system | |
Takabi et al. | Semantic-based policy management for cloud computing environments | |
US11475064B2 (en) | System and method in a database system for creating a field service work order | |
US8489561B1 (en) | Learning enterprise portal content meta-model | |
US10210228B2 (en) | Federated management of a plurality of metadata storage mechanisms | |
US9654522B2 (en) | Methods and apparatus for an integrated incubation environment | |
US20150100888A1 (en) | Providing a common interface for accessing and presenting component configuration settings | |
CN116662437A (en) | Method, apparatus, device and medium for content management | |
CN114218590A (en) | Authority configuration method, device, equipment and medium | |
US20250045286A1 (en) | Systems and method for inferred graph data link attestation | |
US20250013664A1 (en) | Accordian spokes system | |
US11075916B2 (en) | Entitlement-driven communication of functionality privileges between network-based services |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: MICROSOFT CORPORATION, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ROCHE, MATTHEW;LIENSBERGER, CHRISTIAN;KASPERSKI, ZIV;AND OTHERS;SIGNING DATES FROM 20131112 TO 20131113;REEL/FRAME:031620/0795 |
|
AS | Assignment |
Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MICROSOFT CORPORATION;REEL/FRAME:034747/0417 Effective date: 20141014 Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MICROSOFT CORPORATION;REEL/FRAME:039025/0454 Effective date: 20141014 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |