Multi-source data synchronous updating system based on space-time digital base
Technical Field
The invention relates to the technical field of data synchronous updating, in particular to a multi-source data synchronous updating system based on a space-time digital base.
Background
Along with the continuous advancement of big data technology and informatization process, more and more application scenes relate to the collection, processing and management of multi-source heterogeneous data, and the data sources possibly comprise a sensor network, internet of things equipment, satellite data, video monitoring and the like, the data types comprise structured data, semi-structured data and unstructured data, so that in order to better utilize the massive and diversified data, space-time information becomes a key dimension for realizing data integration and analysis, and particularly in the complex application fields of smart cities, intelligent transportation, industrial Internet and the like, unified alignment and efficient analysis of the multi-source data can be realized through space-time management, however, how to realize synchronous update and intelligent analysis of the data in the space-time dimension is still an important challenge faced by the current technology.
In the prior art, a decentralized or independent processing mode is generally adopted for managing and updating multi-source data, so that the real-time performance and consistency of the data are poor, in many systems, the space-time marking of the data is not accurate enough, the effective alignment cannot be realized in time and space dimensions, further, the subsequent analysis and decision process is influenced, in addition, because the updating frequency and time sequence relation of each data source are different, the system often lacks an intelligent synchronization mechanism, the updating lag or the data redundancy is caused, and meanwhile, when the large-scale heterogeneous data are processed in the prior art, the prior art generally depends on a fixed updating strategy, lacks dynamic adjustment capability, is difficult to cope with complex application scenes with frequent data changes, and the problems limit the application value of the multi-source data in intelligent analysis.
Aiming at the defects in the prior art, the invention provides a multi-source data synchronous updating system based on a space-time digital base, which can analyze and early warn updated multi-source data in real time, provides stronger support for decision making in complex scenes and improves the efficiency and expansibility of the system.
Disclosure of Invention
The invention provides a multi-source data synchronous updating system based on a space-time digital base.
The multi-source data synchronous updating system based on the space-time digital base comprises a data acquisition module, a space-time data management module, a data preprocessing module, a data synchronous updating module and an intelligent analysis module, wherein the data acquisition module is used for acquiring data of a user;
the data acquisition module acquires different types of multi-source data in real time from a plurality of data sources, wherein the data sources comprise a sensor network, satellite data, internet of things equipment and video monitoring, and the multi-source data comprise structured data, semi-structured data and unstructured data;
The space-time data management module performs space-time marking and management on the collected multi-source data, establishes a unified space-time index system and performs space-time management based on a space-time digital base;
The data preprocessing module is used for preprocessing the acquired multi-source data, and comprises formatting, denoising and deduplication;
The data synchronous updating module realizes synchronous updating of multi-source data based on the space-time digital base, and dynamically calculates updating frequency and time sequence relation of various data sources by adopting an intelligent algorithm;
the intelligent analysis module performs intelligent analysis on the synchronously updated multi-source data, provides real-time feedback and early warning, generates decision suggestions or executes corresponding adjustment operation.
Optionally, the spatiotemporal data management module includes:
constructing a space-time index, namely constructing a multidimensional space-time index through a Geographic Information System (GIS) technology, and supporting the positioning of different data types in time and space;
the space-time data association is used for associating data with different time and different space dimensions, supporting the tracing of historical data and the dynamic generation of space-time paths, and realizing the semantic association and logical relation construction of the space-time data;
And the space-time data storage is used for storing indexed and associated space-time data, supporting access operation and having distributed storage capacity.
Optionally, the distributed storage includes:
the distributed data acquisition nodes are distributed at different geographic positions and used for locally acquiring data and uploading the data to the central data processing center, and each node has the functions of data preprocessing and temporary storage;
Distributed computing nodes, namely, processing large-scale data in a distributed mode on different computing nodes, reasonably distributing computing resources through a task scheduling mechanism, and supporting parallel processing of space-time data;
And the distributed storage nodes are used for providing distributed storage service for the space-time data and supporting an elastic expansion and redundancy backup mechanism.
Optionally, the data synchronization update module includes:
the priority evaluation of space-time data, namely, the data of each data source can be subjected to priority evaluation according to the real-time property, the space-time dependency and the importance of the data source;
Updating frequency calculation, namely dynamically adjusting the updating frequency based on the priority of the data and the current system resource condition;
Timing relationship calculation the timing relationship of data synchronization is managed by a timing diagram, the update timing of each data source being dependent on the updates of the other related data sources.
Optionally, the calculation formula of the spatio-temporal data priority is:
;
Wherein, Is the data sourceIs set according to the priority of (1),Is the data sourceIs used to update the historical update frequency of (a),Is the data sourceIs used to determine the spatial-temporal dependent weights of (1),Is the data sourceIs added to the importance of the weight of (a),Is a weight coefficient.
Optionally, the update frequency calculation is expressed as:
;
Wherein, Is the data sourceIs updated (number of updates per unit time),Is the data sourceIs used for the time interval of (a),For the current utilization of the system resources,AndIs a weight coefficient.
Optionally, the timing relationship is represented by an adjacency matrix, expressed as:
;
The timing update is expressed as: ;
Wherein, Is the data sourceIs used for the next time of updating the data,For all dependent data sourcesIs used for the update time of the (c),Is the data sourceIs a time delay of (a) to (b).
Optionally, the intelligent analysis module includes:
The data early warning is based on the synchronized multi-source data, and the functions of space-time dynamic monitoring and intelligent early warning are supported;
data clustering and classifying, namely classifying data from different data sources according to preset rules;
And (3) data visualization, namely performing visual display on the classification result in the forms of graphs, charts and the like, and supporting the generation of interactive analysis views.
The invention has the beneficial effects that:
According to the invention, data acquisition, management, preprocessing, synchronous updating and intelligent analysis from a plurality of heterogeneous data sources are realized through the multi-module collaborative work, the system realizes the accurate marking and management of the space-time dimension of multi-source data through the space-time data management module, the consistency of the data of different data sources in time and space is ensured, the alignment precision of the data is greatly improved, secondly, the data preprocessing module carries out formatting, denoising and deduplication processing on the acquired data, the quality and the integrity of the data are ensured, the data synchronous updating module can intelligently adjust the updating frequency and the time sequence relation of the data according to the space-time priority and the system resource condition, the real-time performance and the high efficiency of the data synchronization are ensured, and in addition, the intelligent analysis module carries out analysis on the synchronized data to generate a visual analysis result, provides real-time early warning and decision advice, and the application value and the response capability of the system are greatly improved.
The system not only improves the timeliness and the space consistency of data management, but also provides deep insight into multi-source data through intelligent analysis, supports synchronous updating and dynamic analysis of large-scale heterogeneous data, and particularly can remarkably improve the operation efficiency and the data processing capacity of the system in complex scenes with large data quantity, various data types and frequent change.
Drawings
In order to more clearly illustrate the invention or the technical solutions of the prior art, the drawings which are used in the description of the embodiments or the prior art will be briefly described, it being obvious that the drawings in the description below are only of the invention and that other drawings can be obtained from them without inventive effort for a person skilled in the art.
Fig. 1 is a schematic diagram of a system functional module according to an embodiment of the present invention.
Detailed Description
The invention will now be described in detail with reference to the drawings and to specific embodiments. While the invention has been described herein in detail in order to make the embodiments more detailed, the following embodiments are preferred and can be embodied in other forms as well known to those skilled in the art, and the accompanying drawings are only for the purpose of describing the embodiments more specifically and are not intended to limit the invention to the specific forms disclosed herein.
It should be noted that references in the specification to "one embodiment," "an example embodiment," "some embodiments," etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the relevant art to effect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
Generally, the terminology may be understood, at least in part, from the use of context. For example, the term "one or more" as used herein may be used to describe any feature, structure, or characteristic in a singular sense, or may be used to describe a combination of features, structures, or characteristics in a plural sense, depending at least in part on the context. In addition, the term "based on" may be understood as not necessarily intended to convey an exclusive set of factors, but may instead, depending at least in part on the context, allow for other factors that are not necessarily explicitly described.
As shown in fig. 1, a multi-source data synchronous updating system based on a space-time digital base, the system comprises a data acquisition module, a space-time data management module, a data preprocessing module, a data synchronous updating module and an intelligent analysis module, wherein the data acquisition module is used for acquiring data of a user;
the data acquisition module acquires different types of multi-source data in real time from a plurality of data sources, wherein the data sources comprise a sensor network, satellite data, internet of things equipment and video monitoring, and the multi-source data comprises structured data, semi-structured data and unstructured data;
the space-time data management module performs space-time marking and management on the collected multi-source data, establishes a unified space-time index system, enables heterogeneous data of different sources to be accurately aligned in space-time dimension, performs space-time management based on a space-time digital base, and ensures consistency of the data in two dimensions of time and space;
The data preprocessing module performs preprocessing on the acquired multi-source data, including formatting, denoising and de-duplication, so as to ensure the data quality;
The data synchronous updating module realizes synchronous updating of multi-source data based on the space-time digital base, and dynamically calculates updating frequency and time sequence relation of various data sources by adopting an intelligent algorithm;
the intelligent analysis module performs intelligent analysis on the synchronously updated multi-source data, provides real-time feedback and early warning, generates decision suggestions or executes corresponding adjustment operation.
The space-time data management module comprises:
Constructing a multi-dimensional space-time index through a Geographic Information System (GIS) technology, supporting the positioning of different data types in time and space, and being capable of quickly searching a data set in any space-time segment;
the space-time data association is used for associating data with different time and different space dimensions, supporting the tracing of historical data and the dynamic generation of space-time paths, and realizing the semantic association and logical relation construction of the space-time data;
And the space-time data storage is used for storing indexed and associated space-time data, supports access operation, has distributed storage capacity, and ensures the expansibility of large-scale data processing and the storage safety.
The distributed storage includes:
the distributed data acquisition nodes are distributed at different geographic positions and used for locally acquiring data and uploading the data to the central data processing center, and each node has the functions of data preprocessing and temporary storage;
Distributed computing nodes, namely, processing large-scale data in a distributed mode on different computing nodes, reasonably distributing computing resources through a task scheduling mechanism, and supporting parallel processing of space-time data;
And the distributed storage nodes provide distributed storage service for the space-time data, support elastic expansion and redundancy backup mechanisms and ensure the safety and high availability of data storage.
The data synchronization updating module comprises:
the priority evaluation of space-time data, namely, the data of each data source can be subjected to priority evaluation according to the real-time property, the space-time dependency and the importance of the data source;
Updating frequency calculation, namely dynamically adjusting the updating frequency based on the priority of the data and the current system resource condition;
Timing relationship calculation the timing relationship of data synchronization is managed by a timing diagram, the update timing of each data source being dependent on the updates of the other related data sources.
The calculation formula of the space-time data priority is as follows:
;
Wherein, Is the data sourceIs set according to the priority of (1),Is the data sourceIs indicative of how frequently it changes,Is the data sourceRepresenting the temporal and spatial dependence of the data with other data sources,Is the data sourceThe importance weight of the data is set according to the actual application scene, the importance degree of the data is represented,Is a weight coefficient.
The update frequency calculation is expressed as:
;
Wherein, Is the data sourceIs updated (number of updates per unit time),Is the data sourceRepresenting the data sourceIs a time interval from the present time interval,For the current system resource utilization rate, the range is thatAnd, representing the load condition of the system,AndThe weighting coefficient represents the influence degree of the priority and the utilization rate of the system resources on the update frequency.
The timing relationship is represented by an adjacency matrix, expressed as:
;
The timing update is expressed as: ;
Wherein, Is the data sourceIs used for the next time of updating the data,For all dependent data sourcesIs used for the update time of the (c),Is the data sourceRepresents the timing skew allowed during the data synchronization.
The intelligent analysis module comprises:
Based on the synchronized multi-source data, the data early warning system supports space-time dynamic monitoring and intelligent early warning functions and helps a user to identify possible abnormal or emergency situations in advance;
Data clustering and classification, namely classifying data from different data sources according to preset rules, so that the structured management and quick query of large-scale data are facilitated;
and data visualization, namely visually displaying classification results in the forms of graphs, charts and the like, supporting the generation of interactive analysis views, and helping users to intuitively know the space-time distribution and change of data.
The invention is intended to cover any alternatives, modifications, equivalents, and variations that fall within the spirit and scope of the invention. In the following description of preferred embodiments of the invention, specific details are set forth in order to provide a thorough understanding of the invention, and the invention will be fully understood to those skilled in the art without such details. In other instances, well-known methods, procedures, flows, components, circuits, and the like have not been described in detail so as not to unnecessarily obscure aspects of the present invention.
The foregoing is merely a preferred embodiment of the present invention and it should be noted that modifications and adaptations to those skilled in the art may be made without departing from the principles of the present invention, which are intended to be comprehended within the scope of the present invention.