[go: up one dir, main page]

CN115994313A - Crowd movement modeling method and device based on visiting location clustering - Google Patents

Crowd movement modeling method and device based on visiting location clustering Download PDF

Info

Publication number
CN115994313A
CN115994313A CN202310281628.2A CN202310281628A CN115994313A CN 115994313 A CN115994313 A CN 115994313A CN 202310281628 A CN202310281628 A CN 202310281628A CN 115994313 A CN115994313 A CN 115994313A
Authority
CN
China
Prior art keywords
individual
access
visit
exploration
individuals
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310281628.2A
Other languages
Chinese (zh)
Other versions
CN115994313B (en
Inventor
李楠
张馨元
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN202310281628.2A priority Critical patent/CN115994313B/en
Publication of CN115994313A publication Critical patent/CN115994313A/en
Application granted granted Critical
Publication of CN115994313B publication Critical patent/CN115994313B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本申请公开了一种基于访问地点聚类的人群移动建模方法及装置,属于轨迹预测领域,方法包括:基于历史人群移动轨迹数据获取人群中多个个体的访问地点和访问地点聚类;根据访问地点、访问地点聚类和历史人群移动轨迹提取人群移动特征,并根据人群移动特征生成人群移动模型的模型参数,人群移动特征包括个体的所有访问地点及其所属访问地点聚类的访问频率排序间的条件概率分布;将设定的模拟个体数量和目标时长输入人群移动模型后生成目标时长内多个模拟个体的移动轨迹。本申请不仅能够预测个体的移动轨迹,还能够预测个体在访问地点聚类之间和访问地点聚类内部的访问模式,复现访问地点和访问地点聚类之间的内在联系,填补了现有模型的缺陷。

Figure 202310281628

The present application discloses a crowd movement modeling method and device based on visiting location clustering, which belongs to the field of trajectory prediction. The method includes: obtaining the visiting locations and visiting location clustering of multiple individuals in the crowd based on historical crowd moving trajectory data; Crowd movement characteristics are extracted by visiting locations, visiting location clusters and historical crowd movement trajectories, and model parameters of the crowd movement model are generated according to the crowd moving characteristics. The conditional probability distribution between them; the set number of simulated individuals and the target duration are input into the crowd movement model to generate the movement trajectories of multiple simulated individuals within the target duration. This application can not only predict the individual's movement trajectory, but also predict the individual's visit patterns between visit location clusters and within visit location clusters, and reproduce the internal relationship between visit locations and visit location clusters, filling the existing model flaws.

Figure 202310281628

Description

基于访问地点聚类的人群移动建模方法及装置Crowd movement modeling method and device based on visiting location clustering

技术领域technical field

本申请涉及轨迹预测技术领域,特别涉及一种基于访问地点聚类的人群移动建模方法及装置。The present application relates to the technical field of trajectory prediction, in particular to a crowd movement modeling method and device based on clustering of visiting locations.

背景技术Background technique

近几十年,由于通话记录数据、移动设备定位数据、社交媒体打卡数据等人群移动轨迹大数据的可获得性越来越高,人群移动轨迹处理方法和人群移动建模方法快速发展,并在解决城市规划、交通工程、疾病传播控制等实际问题上取得了越来越多的应用。人群移动建模方法能够模拟城市中个体的移动轨迹,并使这些移动轨迹符合经验数据中发现的人群移动在个体层面和群体层面的基本规律,进而复现城市人群移动的关键特征。如果无法精准预测人群的移动规律,复现人群移动的关键特征,就无法为相关政策的合理制定提供准确的参考。In recent decades, due to the increasing availability of big data on crowd movement trajectories such as call record data, mobile device location data, and social media check-in data, crowd movement trajectory processing methods and crowd movement modeling methods have developed rapidly, and It has achieved more and more applications in solving practical problems such as urban planning, traffic engineering, and disease transmission control. The crowd movement modeling method can simulate the movement trajectories of individuals in the city, and make these movement trajectories conform to the basic laws of crowd movement at the individual and group levels found in empirical data, and then reproduce the key characteristics of urban crowd movement. If it is impossible to accurately predict the law of crowd movement and reproduce the key characteristics of crowd movement, it will be impossible to provide accurate reference for the rational formulation of relevant policies.

现有研究中提出了一系列人群移动的细粒度模型,其中,探索及偏好返回(EPR)模型及其变体,如d-EPR模型和PEPR模型,是影响力最大也是最广为使用的人群移动模型。但在现有的人群移动模型中,个体都是在独立的地点间进行移动,这些访问地点的空间分布特征及它们的内在联系则被完全忽略。然而事实上,个体的访问地点会在物理空间中形成聚类,这些访问地点聚类有着丰富的语义特征,它们通常与个体的某类行为相关联并组成了个体日常生活中的各类活动空间(如居住活动空间,工作活动空间,娱乐活动空间等),并影响着个体的移动。以访问地点聚类视角,而不是相互独立的访问地点的视角对人群移动进行建模,一方面可以精准复现个体在访问地点聚类之间及各访问地点聚类内部的移动规律,另一方面也可以将人的移动行为与城市空间建立联系,有助于理解人们与各类城市空间的交互过程。A series of fine-grained models of crowd movement have been proposed in existing studies, among which the Exploration and Preference Return (EPR) model and its variants, such as d-EPR model and PEPR model, are the most influential and widely used crowd models. Mobile model. However, in the existing crowd movement models, individuals move between independent locations, and the spatial distribution characteristics of these visited locations and their internal connections are completely ignored. However, in fact, individuals’ visit locations will form clusters in physical space, and these visit location clusters have rich semantic features, which are usually associated with certain types of behaviors of individuals and constitute various activity spaces in individuals’ daily life (such as residential activity space, work activity space, entertainment activity space, etc.), and affects the movement of individuals. Modeling crowd movement from the perspective of visiting location clusters instead of independent visiting locations can accurately reproduce the movement rules of individuals between visiting location clusters and within each visiting location cluster. On the one hand, it can also establish a connection between people's mobile behavior and urban space, which helps to understand the interaction process between people and various urban spaces.

综上所述,如何将访问地点聚类对个体移动的影响考虑到人群移动建模中是目前人群移动建模方法需要解决的问题之一。To sum up, how to take into account the impact of visiting location clustering on individual movement in crowd movement modeling is one of the problems to be solved in the current crowd movement modeling method.

发明内容Contents of the invention

本申请提供一种基于访问地点聚类的人群移动建模方法及装置,不仅能够预测个体的移动轨迹,还能够预测个体在访问地点聚类之间和访问地点聚类内部的访问模式,复现访问地点和访问地点聚类之间的内在联系,填补了现有模型的缺陷。This application provides a crowd movement modeling method and device based on clustering of visiting locations, which can not only predict the movement trajectory of individuals, but also predict the visiting patterns of individuals between clusters of visiting locations and within clusters of visiting locations. The intrinsic connection between visited places and visited place clusters fills the gaps of existing models.

本申请第一方面实施例提供一种基于访问地点聚类的人群移动建模方法,包括以下步骤:基于历史人群移动轨迹数据获取人群中多个个体的访问地点和访问地点聚类;根据所述访问地点、所述访问地点聚类和所述历史人群移动轨迹提取人群移动特征,并根据所述人群移动特征生成人群移动模型的模型参数,其中,所述人群移动特征包括个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布;将设定的模拟个体数量和目标时长输入所述人群移动模型,通过所述人群移动模型生成所述目标时长内多个模拟个体的移动轨迹。The embodiment of the first aspect of the present application provides a crowd movement modeling method based on clustering of visiting places, including the following steps: acquiring visiting places and visiting place clusters of multiple individuals in the crowd based on historical crowd movement trajectory data; according to the Extracting crowd movement features from the visiting locations, the clustering of the visiting spots, and the historical crowd movement trajectories, and generating model parameters of the crowd movement model according to the crowd movement features, wherein the crowd movement features include all individual visited places The conditional probability distribution between the visiting frequency sorting and the visiting frequency sorting of the visiting location clusters to which it belongs; input the set number of simulated individuals and the target duration into the crowd movement model, and generate the number of people within the target duration through the crowd movement model The trajectory of a simulated individual.

可选地,在本申请的一个实施例中,所述基于历史人群移动轨迹数据获取人群中多个个体的访问地点和访问地点聚类,包括:根据所述历史人群移动轨迹数据确定个体轨迹中的多个位置记录点,计算所述多个位置记录点的当前速度,将当前速度小于等于预设速度阈值的位置记录点标记为停留位置记录点;对所有停留位置记录点进行聚类,计算个体在每个位置记录点聚类中的单次停留时间,根据所述单次停留时间确定个体的访问地点,其中,所述访问地点的坐标为对应位置记录点聚类的中心;对个体的所有访问地点进行聚类,得到所述访问地点聚类。Optionally, in an embodiment of the present application, the acquiring visit locations and visit location clusters of multiple individuals in the crowd based on historical crowd movement trajectory data includes: a plurality of position recording points, calculate the current speed of the plurality of position recording points, and mark the position recording points whose current speed is less than or equal to the preset speed threshold as staying position recording points; cluster all the staying position recording points, and calculate The single stay time of the individual in each location record point cluster, and determine the individual's visit location according to the single stay time, wherein, the coordinates of the visit place are the center of the corresponding location record point cluster; All the visiting locations are clustered to obtain the visiting location clusters.

可选地,在本申请的一个实施例中,根据所述访问地点、所述访问地点聚类和所述历史人群移动轨迹提取人群移动特征,包括:根据个体对各个访问地点的访问顺序及每次访问的到达时间、离开时间,计算每个个体每次访问的停留时间,并统计所有个体的停留时间分布;根据个体对各访问地点的访问顺序,计算每个个体各次访问之间的移动步长,并统计所有个体的移动步长分布;根据个体的历史移动轨迹,统计在不同访问地点数量下个体的探索概率;根据个体的历史移动轨迹,统计在不同访问地点数量下个体的随机探索概率;将个体的所有访问地点和所有访问地点聚类的访问频率进行排序,计算个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布。Optionally, in an embodiment of the present application, extracting crowd movement features according to the visiting places, the clustering of the visiting places and the historical crowd movement trajectory includes: according to the order of individual visits to various visiting places and each According to the arrival time and departure time of each visit, calculate the stay time of each individual visit, and count the stay time distribution of all individuals; according to the order of visits by individuals to each visit location, calculate the movement of each individual visit Step length, and count the distribution of moving steps of all individuals; According to the historical movement trajectory of the individual, the exploration probability of the individual under different numbers of visited locations is counted; According to the historical movement trajectory of the individual, the random exploration of the individual under different numbers of visited locations is counted Probability: sort all the visit locations of the individual and the visit frequencies of all visit location clusters, and calculate the conditional probability distribution between the visit frequency rankings of all the visit locations of the individual and the visit frequency rankings of the visit location clusters to which they belong.

可选地,在本申请的一个实施例中,通过所述人群移动模型生成所述目标时长内多个模拟个体的移动轨迹,包括:选定每个模拟个体的初始位置,并设定初始时刻为0;根据所述人群移动特征提取所述模拟个体的停留时间,并根据所述模拟个体的探索概率进行探索行为或返回行为;在所述模拟个体为探索行为时,根据所述模拟个体的随机探索概率进行随机探索行为或聚类内探索行为;在所述模拟个体进行随机探索行为时,根据所述人群移动特征提取移动步长,并在所述移动步长范围内,确定所述模拟个体在所述访问地点聚类以外的任一探索地点并进行访问;在所述模拟个体进行聚类内探索行为时,根据访问地点聚类的访问频率排序确定所述模拟个体的待访问地点聚类,根据所述人群移动特征提取所述移动步长,并在所述移动步长范围内,确定所述模拟个体在所述待访问地点聚类内的任一探索地点并进行访问;在所述模拟个体为返回行为时,根据访问地点聚类的访问频率确定所述模拟个体的待访问地点聚类,根据所述待访问地点聚类内的访问地点的访问频率确定所述模拟个体的待访问地点并进行访问;计算所述模拟个体在多个访问地点的总停留时间是否大于等于所述目标时长,若所述总停留时间大于等于所述目标时长,则根据所述多个访问地点的位置和访问顺序生成所述模拟个体的移动轨迹,若所述总停留时间小于所述目标时长,则继续根据所述模拟个体的探索概率进行探索行为或返回行为,直至所述总停留时间大于等于所述目标时长。Optionally, in one embodiment of the present application, generating the movement trajectories of multiple simulated individuals within the target duration through the crowd movement model includes: selecting the initial position of each simulated individual, and setting the initial time is 0; the residence time of the simulated individual is extracted according to the movement characteristics of the crowd, and the exploration behavior or return behavior is performed according to the exploration probability of the simulated individual; when the simulated individual is an exploration behavior, according to the simulated individual’s Random exploration probability to perform random exploration behavior or cluster exploration behavior; when the simulated individual performs random exploration behavior, extract the moving step according to the movement characteristics of the crowd, and determine the simulation within the range of the moving step The individual explores and visits any location other than the cluster of visited locations; when the simulated individual performs the exploration behavior within the cluster, the cluster of locations to be visited by the simulated individual is determined according to the order of visit frequency of the cluster of visited locations. class, extract the moving step size according to the movement characteristics of the crowd, and within the range of the moving step size, determine and visit any exploration place of the simulated individual in the cluster of places to be visited; When the simulated individual is in the returning behavior, the cluster of places to be visited by the simulated individual is determined according to the visit frequency of the clustered places to be visited, and the cluster of places to be visited by the simulated individual is determined according to the visit frequency of the places to be visited within the cluster of places to be visited. Visit the location and conduct the visit; calculate whether the total stay time of the simulated individual at multiple visit locations is greater than or equal to the target duration, if the total stay time is greater than or equal to the target duration, then according to the multiple visit locations The location and visit sequence generate the movement track of the simulated individual, if the total stay time is less than the target duration, continue to explore or return according to the exploration probability of the simulated individual until the total stay time is greater than or equal to The target duration.

本申请第二方面实施例提供一种基于访问地点聚类的人群移动建模装置,包括:获取模块,用于基于历史人群移动轨迹数据获取人群中多个个体的访问地点和访问地点聚类;参数生成模块,用于根据所述访问地点、所述访问地点聚类和所述历史人群移动轨迹提取人群移动特征,并根据所述人群移动特征生成人群移动模型的模型参数,其中,所述人群移动特征包括个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布;轨迹生成模块,用于将设定的模拟个体数量和目标时长输入所述人群移动模型,通过所述人群移动模型生成所述目标时长内多个模拟个体的移动轨迹。The embodiment of the second aspect of the present application provides a crowd movement modeling device based on clustering of visiting places, including: an acquisition module, configured to acquire visiting places and visiting place clusters of multiple individuals in the crowd based on historical crowd movement trajectory data; A parameter generating module, configured to extract crowd movement characteristics according to the visited locations, clusters of the visited locations, and the historical crowd movement trajectories, and generate model parameters of a crowd movement model according to the crowd movement characteristics, wherein the crowd The movement characteristics include the conditional probability distribution between the visit frequency rankings of all the visiting places of the individual and the visiting frequency rankings of the visiting place clusters to which they belong; the trajectory generation module is used to input the set number of simulated individuals and target duration into the crowd movement model, using the crowd movement model to generate movement trajectories of multiple simulated individuals within the target duration.

可选地,在本申请的一个实施例中,所述获取模块,包括:识别单元,用于根据所述历史人群移动轨迹数据确定个体轨迹中的多个位置记录点,计算所述多个位置记录点的当前速度,将当前速度小于等于预设速度阈值的位置记录点标记为停留位置记录点;第一聚类单元,用于对所有停留位置记录点进行聚类,计算个体在每个位置记录点聚类中的单次停留时间,根据所述单次停留时间确定个体的访问地点,其中,所述访问地点的坐标为对应位置记录点聚类的中心;第二聚类单元,用于对个体的所有访问地点进行聚类,得到所述访问地点聚类。Optionally, in an embodiment of the present application, the acquisition module includes: an identification unit, configured to determine a plurality of position recording points in the individual trajectory according to the historical crowd movement trajectory data, and calculate the plurality of position record points Record the current speed of the point, and mark the position record point whose current speed is less than or equal to the preset speed threshold as the stay position record point; the first clustering unit is used to cluster all the stay position record points, and calculate the individual’s time at each position A single stay time in the record point cluster, according to the single stay time to determine the individual's visit location, wherein the coordinates of the visit place is the center of the corresponding position record point cluster; the second clustering unit is used for All the visit places of the individual are clustered to obtain the visit place clusters.

可选地,在本申请的一个实施例中,所述参数生成模块,包括:第一提取单元,用于根据个体对各个访问地点的访问顺序及每次访问的到达时间、离开时间,计算每个个体每次访问的停留时间,并统计所有个体的停留时间分布;第二提取单元,用于根据个体对各访问地点的访问顺序,计算每个个体各次访问之间的移动步长,并统计所有个体的移动步长分布;第三提取单元,用于根据个体的历史移动轨迹,统计在不同访问地点数量下个体的探索概率;第四提取单元,用于根据个体的历史移动轨迹,统计在不同访问地点数量下个体的随机探索概率;第五提取单元,用于将个体的所有访问地点和所有访问地点聚类的访问频率进行排序,计算个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布。Optionally, in one embodiment of the present application, the parameter generation module includes: a first extraction unit, configured to calculate the time of each visit according to the individual's visit sequence to each visit location and the arrival time and departure time of each visit. The residence time of each individual visit, and count the distribution of the residence time of all individuals; the second extraction unit is used to calculate the moving step size between each visit of each individual according to the order of visits by individuals to each visit location, and The distribution of moving steps of all individuals is counted; the third extraction unit is used to count the exploration probability of individuals under different numbers of visited locations according to the historical moving trajectory of the individual; the fourth extraction unit is used to calculate the statistical The random exploration probability of individuals under different numbers of visiting locations; the fifth extraction unit is used to sort all the visiting locations of the individual and the visiting frequencies of all visiting locations clustered, and calculate the ranking of the visiting frequencies of all visiting locations of the individual and their belonging Conditional probability distribution among visit frequency orderings for clusters of visit locations.

可选地,在本申请的一个实施例中,所述轨迹生成模块,包括:初始单元,用于选定每个模拟个体的初始位置,并设定初始时刻为0;探索单元,用于根据所述人群移动特征提取所述模拟个体的停留时间,并根据所述模拟个体的探索概率进行探索行为或返回行为;所述探索单元,还用于在所述模拟个体为探索行为时,根据所述模拟个体的随机探索概率进行随机探索行为或聚类内探索行为;所述探索单元,还用于在所述模拟个体进行随机探索行为时,根据所述人群移动特征提取移动步长,并在所述移动步长范围内,确定所述模拟个体在所述访问地点聚类以外的任一探索地点并进行访问;所述探索单元,还用于在所述模拟个体进行聚类内探索行为时,根据访问地点聚类的访问频率排序确定所述模拟个体的待访问地点聚类,根据所述人群移动特征提取所述移动步长,并在所述移动步长范围内,确定所述模拟个体在所述待访问地点聚类内的任一探索地点并进行访问;所述探索单元,还用于在所述模拟个体为返回行为时,根据访问地点聚类的访问频率确定所述模拟个体的待访问地点聚类,根据所述待访问地点聚类内的访问地点的访问频率确定所述模拟个体的待访问地点并进行访问;输出单元,用于计算所述模拟个体在多个访问地点的总停留时间是否大于等于所述目标时长,若所述总停留时间大于等于所述目标时长,则根据所述多个访问地点的位置和访问顺序生成所述模拟个体的移动轨迹,若所述总停留时间小于所述目标时长,则继续根据所述模拟个体的探索概率进行探索行为或返回行为,直至所述总停留时间大于等于所述目标时长。Optionally, in one embodiment of the present application, the trajectory generation module includes: an initial unit, configured to select the initial position of each simulated individual, and set the initial moment to 0; an exploration unit, configured to The crowd movement feature extracts the residence time of the simulated individual, and performs exploration behavior or return behavior according to the exploration probability of the simulated individual; the exploration unit is also used to The random exploration probability of the simulated individual performs random exploration behavior or cluster exploration behavior; the exploration unit is also used to extract the moving step according to the movement characteristics of the crowd when the simulated individual performs random exploration behavior, and Within the range of the moving step, it is determined that the simulated individual is at any exploration location outside the cluster of the visited locations and visits; the exploration unit is also used for when the simulated individual performs an exploration behavior within the cluster , determine the location clusters to be visited by the simulated individuals according to the visit frequency ranking of the visited location clusters, extract the moving step size according to the movement characteristics of the crowd, and determine the simulated individual within the range of the moving step size Explore and visit any location within the cluster of locations to be visited; the exploration unit is further configured to determine the simulated individual's visit frequency according to the visit frequency of the visited location cluster when the simulated individual is returning. Clustering of places to be visited, determining and visiting places to be visited by the simulated individual according to the visit frequency of the places to be visited in the cluster of places to be visited; an output unit for calculating the number of places to be visited by the simulated individual in multiple places to visit Whether the total stay time is greater than or equal to the target duration, if the total stay time is greater than or equal to the target duration, then generate the movement trajectory of the simulated individual according to the positions and visit sequences of the multiple visit locations, if the total stay time is greater than or equal to the target duration If the stay time is less than the target duration, continue to perform exploration or return behavior according to the exploration probability of the simulated individual until the total stay time is greater than or equal to the target duration.

本申请实施例的基于访问地点聚类的人群移动建模方法及装置,通过从个体移动轨迹中识别访问地点聚类,利用个体在访问地点聚类层面的移动行为得到人群移动特征,除停留时间、移动步长、个体的探索概率随访问地点数量的变化情况、个体的随机探索概率随访问地点数量的变化情况以外,本申请还创新性地考虑了个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布,通过考虑该条件概率分布,本申请反映了访问地点及访问地点聚类之间的内在联系;刻画了个体移动的两个重要特征:属于高频访问地点聚类的访问地点有更高的概率被个体访问,具有较高访问频率的访问地点更可能出现高频访问地点聚类中;并证明了现有模型在刻画访问地点和其所属访问地点聚类的关联关系上有明显不足。进一步引入了个体移动在访问地点聚类视角下的两阶段决策过程,不仅能够预测个体的移动轨迹,还能够预测个体在访问地点聚类之间和访问地点聚类内部的访问模式,复现访问地点和访问地点聚类之间的内在联系,填补了现有模型的缺陷。The crowd movement modeling method and device based on clustering of visited places in the embodiment of the present application, by identifying visiting place clusters from individual movement trajectories, using individual movement behaviors at the visiting place clustering level to obtain crowd movement characteristics, except for stay time , moving step size, the variation of the individual’s exploration probability with the number of visited locations, and the variation of the individual’s random exploration probability with the number of visited locations, this application also innovatively considers the ordering of the individual’s visit frequencies of all visited locations and their The conditional probability distribution among the visit frequency rankings of the visiting place clusters, by considering the conditional probability distribution, this application reflects the internal connection between the visiting places and the visiting place clusters; it describes two important characteristics of individual movement: belonging to Visiting locations clustered with high-frequency visiting locations have a higher probability of being visited by individuals, and visiting locations with higher visiting frequencies are more likely to appear in high-frequency visiting location clusters; There are obvious deficiencies in the association relationship of visiting location clustering. It further introduces a two-stage decision-making process of individual movement from the perspective of visiting location clusters, which can not only predict the individual's movement trajectory, but also predict the individual's access patterns between visiting location clusters and within visiting location clusters, recurring visits Intrinsically linking clusters of places and visited places fills the gaps of existing models.

本申请附加的方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本申请的实践了解到。Additional aspects and advantages of the application will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the application.

附图说明Description of drawings

本申请上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present application will become apparent and easy to understand from the following description of the embodiments in conjunction with the accompanying drawings, wherein:

图1为根据本申请实施例提供的一种基于访问地点聚类的人群移动建模方法的流程图;FIG. 1 is a flow chart of a crowd movement modeling method based on visiting location clustering provided according to an embodiment of the present application;

图2为根据本申请实施例提供的获取人群中多个个体的访问地点和访问地点聚类的流程图;FIG. 2 is a flow chart of obtaining visiting locations and visiting location clusters of multiple individuals in a crowd according to an embodiment of the present application;

图3为根据本申请实施例提供的提取人群移动特征的流程图;FIG. 3 is a flow chart of extracting crowd movement features according to an embodiment of the present application;

图4为根据本申请实施例提供的个体移动轨迹模拟方法的流程图;FIG. 4 is a flow chart of a method for simulating individual movement trajectories according to an embodiment of the present application;

图5为根据本申请实施例提供的人群移动模型示意图;FIG. 5 is a schematic diagram of a crowd movement model provided according to an embodiment of the present application;

图6为根据本申请实施例的基于访问地点聚类的人群移动建模装置的示例图。Fig. 6 is an example diagram of a crowd movement modeling device based on visiting place clustering according to an embodiment of the present application.

具体实施方式Detailed ways

下面详细描述本申请的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,旨在用于解释本申请,而不能理解为对本申请的限制。Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary, and are intended to explain the present application, and should not be construed as limiting the present application.

图1为根据本申请实施例提供的一种基于访问地点聚类的人群移动建模方法的流程图。Fig. 1 is a flow chart of a crowd movement modeling method based on visiting place clustering according to an embodiment of the present application.

如图1所示,该基于访问地点聚类的人群移动建模方法包括以下步骤:As shown in Figure 1, the crowd movement modeling method based on visiting location clustering includes the following steps:

在步骤S101中,基于历史人群移动轨迹数据获取人群中多个个体的访问地点和访问地点聚类。In step S101 , based on historical crowd movement track data, the visiting locations and visiting location clusters of multiple individuals in the crowd are obtained.

历史人群移动轨迹数据可以为特定的城市在过去一段时间内的人群移动数据,人群移动轨迹数据中包括了个体去的地方以及到达和离开时间等。根据人群移动轨迹数据,可以从中获取到人群中多个个体的访问地点。The historical crowd movement trajectory data can be the crowd movement data of a specific city in the past period of time. The crowd movement trajectory data includes the places where individuals go and the time of arrival and departure. According to the movement track data of the crowd, the visiting locations of multiple individuals in the crowd can be obtained therefrom.

在本申请的一个实施例中,如图2所示,作为一种获取人群中多个个体的访问地点和访问地点聚类的方式,包括:In one embodiment of the present application, as shown in FIG. 2 , as a way of obtaining the visiting locations and visiting location clusters of multiple individuals in the crowd, it includes:

S1011,根据历史人群移动轨迹数据确定个体轨迹中的多个位置记录点,计算多个位置记录点的当前速度,将当前速度小于等于预设速度阈值的位置记录点标记为停留位置记录点。S1011. Determine multiple position record points in the individual trajectory according to the historical crowd movement track data, calculate the current speed of the multiple position record points, and mark the position record points whose current speed is less than or equal to the preset speed threshold as stay position record points.

具体地,本申请的实施例可以提取历史人群移动轨迹数据中每个个体的轨迹,识别个体的轨迹中为停留状态的位置记录点,其中,个体的轨迹由多个带有时间戳的位置记录点组成,位置记录点的状态包括停留状态和移动状态。Specifically, the embodiments of the present application can extract the trajectory of each individual in the historical crowd movement trajectory data, and identify the position record points in the individual's trajectory that are in the stay state, wherein the individual's trajectory is recorded by multiple locations with time stamps Composed of points, the state of the location record point includes the stay state and the moving state.

将个体的轨迹定义为,定义是带有时间戳的位置记录。位置记录的状态可以分为停留状态和移动状态,识别中属于停留状态的位置记录,识别方法如下:Define the trajectory of an individual as ,definition is a timestamped location record. The state of the location record can be divided into the stay state and the moving state. The location records belonging to the stay state in , the identification method is as follows:

设定一个速度阈值(推荐取值为1.3m/s),从第一个位置记录点向后扫描,当位置记录点满足时,该位置记录点的状态为停留状态,反之则为移动状态。set a speed threshold (The recommended value is 1.3m/s), scan backward from the first position record point, when the position record point satisfy , the state of the location record point is the stay state, otherwise it is the moving state.

S1012,对所有停留位置记录点进行聚类,计算个体在每个位置记录点聚类中的单次停留时间,根据单次停留时间确定个体的访问地点,其中,访问地点的坐标为对应位置记录点聚类的中心。S1012, cluster all the stay location record points, calculate the single stay time of the individual in each location record point cluster, and determine the individual visit location according to the single stay time, wherein the coordinates of the visit location are the corresponding location records Center of point clustering.

具体地,本申请的实施例可以利用第一聚类算法对所有停留状态的位置记录点进行聚类,统计个体在每个位置记录点聚类中的单次最大停留时间,将单次最大停留时间大于预设时长的位置记录点聚类作为个体的访问地点,访问地点的坐标为对应位置记录点聚类的中心。Specifically, the embodiment of the present application can use the first clustering algorithm to cluster all location record points in the stay state, count the single maximum stay time of individuals in each location record point cluster, and divide the single maximum stay The location record points whose time is longer than the preset duration are clustered as individual visit locations, and the coordinates of the visit locations are the centers of the corresponding location record point clusters.

第一聚类算法可以使用DBSCAN(Density-Based Spatial Clustering ofApplications with Noise)算法,该算法为一种基于密度的聚类算法。The first clustering algorithm can use the DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm, which is a density-based clustering algorithm.

给定最大扫描半径和最小样本数量(推荐取值分别为50米和2个),使用第一聚类算法对给定个体的所有属于停留状态的位置记录进行聚类。根据聚类结果,统计给定个体在每个位置记录点聚类中的单次最大停留时间。保留所有单次最大停留时间大于预设时长(可以设定为5分钟)的位置记录点聚类,剩余的每个位置记录点聚类代表给定个体的一个访问地点,访问地点的坐标为对应位置记录点聚类的中心。Given a maximum scanning radius and a minimum number of samples (recommended values are 50 m and 2 samples, respectively), the first clustering algorithm is used to cluster all location records belonging to the stay state for a given individual. According to the clustering results, the single maximum stay time of a given individual in each location record point cluster is counted. Keep all location record point clusters with a single maximum stay time greater than the preset duration (can be set to 5 minutes), and each remaining location record point cluster represents a visit location of a given individual, and the coordinates of the visit location are corresponding The center of the location record point cluster.

根据个体的轨迹及聚类结果,个体轨迹可表示为一系列带有时间戳的访问地点记录,进一步地,可以确定给定个体对一系列访问地点的到达时间、离开时间及访问顺序。According to individual trajectories and clustering results, individual trajectories can be expressed as a series of time-stamped visit location records, and further, the arrival time, departure time and visit order of a series of visit locations for a given individual can be determined.

S1013,对个体的所有访问地点进行聚类,得到所述访问地点聚类。S1013. Cluster all the visit locations of the individual to obtain the visit location clusters.

本申请的实施例可以利用第二聚类算法对个体的所有访问地点进行聚类,得到访问地点聚类。In the embodiment of the present application, the second clustering algorithm may be used to cluster all the visited places of the individual to obtain the visited place clusters.

第二聚类算法可以为HDBSCAN(Hierarchical Density-Based SpatialClustering of Applications with Noise)算法,该算法为一种基于DBSCAN算法的层次聚类算法。The second clustering algorithm can be the HDBSCAN (Hierarchical Density-Based Spatial Clustering of Applications with Noise) algorithm, which is a hierarchical clustering algorithm based on the DBSCAN algorithm.

给定聚类提取参数和最小样本数量(推荐取值分别为1千米和2个),使用第二聚类算法对给定个体的所有访问地点进行聚类,得到给定个体的若干访问地点聚类,根据访问地点的聚类结果,可以确定给定个体的各访问地点与各访问聚类的从属关系。Given the cluster extraction parameters and the minimum number of samples (recommended values are 1 km and 2), use the second clustering algorithm to cluster all the visit locations of a given individual, and obtain several visit locations of a given individual Clustering, according to the clustering results of visiting places, the affiliation relationship between each visiting place and each visiting cluster of a given individual can be determined.

在步骤S102中,根据访问地点、访问地点聚类和历史人群移动轨迹提取人群移动特征,并根据人群移动特征生成人群移动模型的模型参数,其中,人群移动特征包括个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布。In step S102, crowd movement characteristics are extracted according to the visiting locations, visiting location clusters, and historical crowd movement trajectories, and model parameters of the crowd movement model are generated according to the crowd movement characteristics, wherein the crowd movement characteristics include the visit frequencies of all the individual visiting locations The conditional probability distribution between the ranks and the visit frequency ranks of the visited place clusters to which they belong.

基于特定城市的人群移动轨迹数据得到了一系列个体的访问地点和访问地点聚类,根据访问地点、访问地点聚类以及人群移动轨迹数据可以提取出人群移动特征,进而根据该特征确定人群移动模型的多个参数。其中,人群移动特征可以包括停留时间、移动步长、探索概率随访问地点数量的变化情况、随机探索概率与探索概率的比值随访问地点数量的变化情况、访问地点的访问频率排序与其所属访问地点聚类的访问频率排序的关系等。A series of individual visit locations and visit location clusters are obtained based on the crowd movement trajectory data of a specific city. According to the visit locations, visit location clusters and crowd movement trajectory data, the crowd movement characteristics can be extracted, and then the crowd movement model can be determined according to the characteristics. multiple parameters. Among them, the crowd movement characteristics can include stay time, moving step length, the change of exploration probability with the number of visited places, the change of the ratio of random exploration probability to explored probability with the number of visited places, the order of visit frequency of visited places and the visited places to which they belong. Clustering access frequency sorting relationship, etc.

本申请的一个实施例中,作为一种提取人群移动特征的方式,如图3所示,根据访问地点、访问地点聚类和历史人群移动轨迹提取人群移动特征,包括:In one embodiment of the present application, as a method of extracting crowd movement features, as shown in Figure 3, crowd movement features are extracted according to visiting locations, visiting location clusters, and historical crowd movement trajectories, including:

S1021,根据个体对各个访问地点的访问顺序及每次访问的到达时间、离开时间,计算每个个体每次访问的停留时间,并统计所有个体的停留时间分布。S1021. Calculate the stay time of each individual for each visit according to the visit order of the individual to each visit location and the arrival time and departure time of each visit, and count the stay time distribution of all individuals.

S1022,根据个体对各访问地点的访问顺序,计算每个个体各次访问之间的移动步长,并统计所有个体的移动步长分布。S1022. According to the individual's visit sequence to each visit location, calculate the movement step size between each visit of each individual, and count the movement step length distribution of all individuals.

S1023,根据个体的历史移动轨迹,统计在不同访问地点数量下个体的探索概率。S1023, according to the historical movement trajectory of the individual, calculate the exploration probability of the individual under different numbers of visited locations.

探索概率随访问地点数量S的变化情况,其统计方法如下:定义个体的每次访问行为分为探索行为及返回行为,当个体访问一个从未去过的访问地点时称该访问行为为探索行为;反之,当个体访问一个曾去过的当问地点时称该访问行为为返回行为。explore probability With the change of the number of visited places S, the statistical method is as follows: each visit behavior of an individual is defined as an exploration behavior and a return behavior. When an individual visits a place that has never been visited before, the visit behavior is called an exploration behavior; , when an individual visits a previously visited location, the visit behavior is called return behavior.

当个体当前已经去过的访问地点数量为S时,定义个体在下次访问时进行探索行为的概率为该个体在访问地点数量为S时的探索概率When the number of places visited by the individual is S, define the probability that the individual will perform exploration behavior in the next visit as the exploration probability of the individual when the number of places visited is S .

对于每个个体,根据其历史移动轨迹,统计在给定不同访问地点数量S时,该个体的探索概率的取值。For each individual, according to its historical movement trajectory, the exploration probability of the individual given the number S of different visited places is counted value of .

S1024,根据个体的历史移动轨迹,统计在不同访问地点数量下个体的随机探索概率。S1024, according to the individual's historical movement trajectory, calculate the random exploration probability of the individual under different numbers of visited locations.

随机探索概率与探索概率的比值随访问地点数量S的变化情况,其统计方法如下:定义个体的每次探索行为分为随机探索行为及聚类内探索行为,当个体在进行探索行为时所访问的访问地点不属于当前的任何一个访问地点聚类时,称该探索行为为随机探索行为;反之,当个体在进行探索行为时所访问的访问地点属于当前的某一个访问地点聚类时,称该探索行为为聚类内探索行为。Random Exploration Probability The ratio of the ratio to the exploration probability varies with the number of visited locations S. The statistical method is as follows: each individual exploration behavior is defined as random exploration behavior and cluster exploration behavior. When it does not belong to any of the current visiting place clusters, the exploration behavior is called random exploring behavior; on the contrary, when the visiting place visited by the individual during the exploring behavior belongs to a certain current visiting place cluster, it is called the exploring behavior Behavior for exploring within clusters.

当个体当前已经去过的访问地点数量为S时,定义个体在下次访问进行探索行为时选择随机探索行为的概率为该个体在访问地点数量为S时的随机探索概率When the number of places visited by the individual is S, define the probability that the individual chooses random exploration behavior in the next visit to explore behavior as the random exploration probability of the individual when the number of visited places is S .

对于每个个体,根据其历史移动轨迹,统计在给定不同访问地点数量S时,该个体的随机探索概率的取值。For each individual, according to its historical movement trajectory, the random exploration probability of the individual given the number S of different visiting places is counted value of .

S1025,将个体的所有访问地点和所有访问地点聚类的访问频率进行排序,计算个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布。S1025. Sorting all the visit locations of the individual and the visit frequencies of all the visit location clusters, and calculating the conditional probability distribution between the visit frequency rankings of all the visit locations of the individual and the visit frequency rankings of the visit location clusters to which they belong.

访问地点的访问频率排序与其所属访问地点聚类的访问频率排序的关系,其统计方法如下:定义个体对某一访问地点的访问次数为该访问地点的访问频率;将所有访问地点按照它们的访问频率由高到低排序,定义某一访问地点的排序为该访问地点的访问频率排序The relationship between the visit frequency ranking of a visit location and the visit frequency ranking of the visit location cluster to which it belongs, the statistical method is as follows: Define the number of visits an individual makes to a visit location as the visit frequency of the visit location ;Sort all the visiting places according to their visiting frequency from high to low, define the sorting of a certain visiting place as the visiting frequency sorting of the visiting place .

定义个体对某一访问地点聚类中所有访问地点的访问次数之和为该访问地点聚类的访问频率;将所有访问地点聚类按照它们的访问频率由高到低排序,定义某一访问地点聚类的排序为该访问地点聚类的访问频率排序Define the sum of visit times of individuals to all visit places in a visit place cluster as the visit frequency of the visit place cluster ;Sort all the visit location clusters according to their visit frequency from high to low, define the sort of a visit location cluster as the visit frequency sort of the visit location cluster .

统计每个体所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序,计算条件概率分布Count the visit frequency sorting of all visit places for each individual and the visit frequency ranking of the cluster of visit places to which it belongs , computing the conditional probability distribution .

本申请实施例的人群移动特征创新性地考虑了个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布,该分布可以反映访问地点及访问地点聚类之间的内在联系;刻画个体移动的两个重要特征:属于高频访问地点聚类的访问地点有更高的概率被个体访问,具有较高访问频率的访问地点更可能出现高频访问地点聚类中;并证明了现有模型在刻画访问地点和其所属访问地点聚类的关联关系上有明显不足。The crowd movement feature in the embodiment of the present application innovatively considers the conditional probability distribution between the visit frequency rankings of all the visiting places of an individual and the visiting frequency rankings of the visiting place clusters to which they belong. This distribution can reflect the visiting places and the visiting place clusters Two important characteristics that characterize the movement of individuals: Visiting places that belong to the cluster of high-frequency visiting places have a higher probability of being visited by individuals, and visiting places with higher visiting frequency are more likely to have clusters of high-frequency visiting places. class; and it is proved that the existing models have obvious deficiencies in describing the association relationship between the visiting places and the visiting place clusters to which they belong.

基于上述提取出的特征,根据下述方法确定模型参数的数值:Based on the features extracted above, the model parameters are determined according to the following method value of:

基于停留时间的分布情况,拟合分布的数值;based on dwell time distribution, fitting the distribution middle value;

基于停留时间的分布情况,拟合分布的数;based on dwell time distribution, fitting the distribution middle number of

基于访问地点数量S与探索概率的关系,拟合等式的数值;Based on the number of visited locations S and the exploration probability relationship, fitting equation middle and value;

基于访问地点数量S与随机探索概率的关系,拟合等式的数值;Based on the number of visited locations S and random exploration probability relationship, fitting equation middle value;

基于访问地点的访问频率排序及其所属访问地点聚类的访问频率排序的条件概率分布,参数的数值的拟合方法如下:Sort by visit frequency based on visit location and the visit frequency ranking of the cluster of visit places to which it belongs The conditional probability distribution of ,parameter and The numerical fitting method of is as follows:

给定访问地点的访问频率排序的取值为,计算概率分布Ranking of frequency of visits for a given place visited The value is , to calculate the probability distribution ;

拟合在给定不同的rl时,指数分布中m的数值;Fitting the exponential distribution given different r l The value of m in;

基于和m的关系,拟合等式的数值。based on and m relationship, fitting equation middle and value.

在步骤S103中,将设定的模拟个体数量和目标时长输入人群移动模型,通过人群移动模型生成目标时长内多个模拟个体的移动轨迹。In step S103, the set number of simulated individuals and the target duration are input into the crowd movement model, and the movement trajectories of multiple simulated individuals within the target duration are generated through the crowd movement model.

可选地,在本申请的一个实施例中,通过人群移动模型生成目标时长内多个模拟个体的移动轨迹,包括:Optionally, in one embodiment of the present application, the movement trajectories of multiple simulated individuals within the target duration are generated through the crowd movement model, including:

选定每个模拟个体的初始位置,并设定初始时刻为0;Select the initial position of each simulated individual, and set the initial moment to 0;

根据人群移动特征提取模拟个体的停留时间,并根据模拟个体的探索概率进行探索行为或返回行为;Extract the residence time of the simulated individual according to the movement characteristics of the crowd, and perform exploration behavior or return behavior according to the exploration probability of the simulated individual;

在模拟个体为探索行为时,根据模拟个体的随机探索概率进行随机探索行为或聚类内探索行为;When the simulated individual is exploring behavior, perform random exploration behavior or cluster exploration behavior according to the random exploration probability of the simulated individual;

在模拟个体进行随机探索行为时,根据人群移动特征提取移动步长,并在移动步长范围内,确定模拟个体在访问地点聚类以外的任一探索地点并进行访问;When simulating an individual’s random exploration behavior, extract the moving step according to the movement characteristics of the crowd, and within the range of the moving step, determine the simulated individual’s visit to any exploration location outside the cluster of visiting locations;

在模拟个体进行聚类内探索行为时,根据访问地点聚类的访问频率排序确定模拟个体的待访问地点聚类,根据人群移动特征提取移动步长,并在移动步长范围内,确定模拟个体在待访问地点聚类内的任一探索地点并进行访问;When simulating the individual’s exploration behavior within the cluster, determine the location cluster to be visited by the simulating individual according to the visit frequency ranking of the visiting location cluster, extract the moving step according to the movement characteristics of the crowd, and determine the simulating individual within the range of the moving step Explore and visit any place within the cluster of places to visit;

在模拟个体为返回行为时,根据访问地点聚类的访问频率确定模拟个体的待访问地点聚类,根据待访问地点聚类内的访问地点的访问频率确定模拟个体的待访问地点并进行访问;When the simulated individual is in the return behavior, determine the location to be visited cluster of the simulated individual according to the visit frequency of the visited location cluster, and determine and visit the location to be visited by the simulated individual according to the visit frequency of the visited locations within the cluster of places to be visited;

计算模拟个体在多个访问地点的总停留时间是否大于等于目标时长,若总停留时间大于等于目标时长,则根据多个访问地点的位置和访问顺序生成模拟个体的移动轨迹,若总停留时间小于目标时长,则继续根据模拟个体的探索概率进行探索行为或返回行为,直至总停留时间大于等于目标时长。Calculate whether the total stay time of the simulated individual in multiple visit locations is greater than or equal to the target duration, if the total stay time is greater than or equal to the target duration, then generate the movement trajectory of the simulated individual according to the positions and visit sequences of multiple visit locations, if the total stay time is less than Target duration, then continue to explore or return according to the exploration probability of the simulated individual until the total stay time is greater than or equal to the target duration.

如图4所示,展示了生成一定数量虚拟个体在一段时间内的移动轨迹的方法的流程。具体操作步骤如下:As shown in FIG. 4 , the flow of the method for generating the moving trajectories of a certain number of virtual individuals within a period of time is shown. The specific operation steps are as follows:

设定拟生成移动轨迹的个体数量和目标时长:Set the number of individuals and the target duration of the movement trajectory to be generated:

对于每个虚拟个体,在选定城市中随机选取一个位置点作为该个体的初始位置;For each virtual individual, randomly select a location point in the selected city as the initial location of the individual;

设定初始时刻为0,重复下述步骤,直到该个体的总停留时间超过目标时长;Set the initial time to 0, repeat the following steps until the total stay time of the individual exceeds the target time;

步骤A:从分布中提取停留时间Step A: From the distribution extraction dwell time ;

步骤B:个体以的概率进行探索行为(转到步骤C),否则,则进行返回行为(转到步骤F);Step B: Individuals take Probability to perform exploration behavior (go to step C), otherwise, perform return behavior (go to step F);

步骤C:个体进行探索行为时,以的概率进行随机探索行为(转到第D步);否则,则进行聚类内探索行为(转到第E步);Step C: When the individual performs exploratory behavior, the Probability to perform random exploration (go to step D); otherwise, perform intra-cluster exploration (go to step E);

步骤D:个体进行随机探索行为时,首先从分布提取移动步长;然后,令个体从以当前位置为圆心,为半径的圆弧上随机选择一个不属于任何一个现有访问地点聚类的地点进行访问;如果在圆弧上不存在任何满足上述要求的地点,则重新生成移动步长并重复步骤D;Step D: When the individual performs random exploration behavior, first from the distribution Extract the move step size ; Then, let the individual start from the current position as the center of the circle, Randomly select a location on the arc of the radius that does not belong to any existing cluster of visited locations to visit; if there is no location on the arc that meets the above requirements, regenerate the moving step and repeat step D;

步骤E:个体进行聚类内探索行为时,以概率选择探索访问频率排序为的访问地点聚类i;在确定要探索的访问地点聚类之后,从分布提取移动步长;然后,令个体从以当前位置为圆心,为半径的圆弧上随机选择一个属于所选聚类的地点进行访问。如果在圆弧上不存在任何满足上述要求的地点,则重新生成移动步长并重复步骤E;Step E: When the individual performs the exploration behavior in the cluster, the probability Choose to explore access frequency sorting cluster i for the visiting locations; after determining the visiting location clusters to be explored, from the distribution Extract the move step size ; Then, let the individual start from the current position as the center of the circle, Randomly select a location on an arc of radius that belongs to the selected cluster to visit. If there is no place on the arc that satisfies the above requirements, regenerate the move step and repeat step E;

步骤F:个体进行返回行为时,首先依据所有访问地点聚类的访问频率选择将要返回的访问地点聚类,返回概率正比于各访问地点聚类的访问频率;然后,根据该访问地点聚类内所有访问地点的访问频率决定将要返回的访问地点,返回概率正比于各访问地点的访问频率。Step F: When the individual performs the return behavior, first select the visiting location cluster to be returned according to the visiting frequency of all visiting location clusters, and the return probability is proportional to the visiting frequency of each visiting location cluster; then, according to the visiting location clustering The visit frequencies of all visit locations determine the visit locations to be returned, and the return probability is proportional to the visit frequency of each visit location.

本申请引入了个体移动在访问地点聚类视角下的两阶段决策过程,即个体首先在聚类层面进行移动决策以决定访问哪个访问地点聚类,然后在访问地点层面进行移动决策以决定访问哪个具体的访问地点。如图5所示,在人群移动模型中,访问地点聚类对个体移动的影响同时存在于偏好返回阶段和探索阶段。在偏好返回阶段,个体首先根据所有访问地点聚类的访问频率决定要访问哪个访问地点聚类,再根据该聚类中所有访问地点的访问频率决定访问哪个具体的访问地点:这一机制体现了组成访问地点聚类的访问地点的吸引力影响了访问地点聚类的返回概率,此外,个体对某一访问地点的访问频率不仅受其自身的吸引力的影响,也受其所属访问地点聚类的吸引力的影响,换言之,其附近其他访问地点的吸引力的影响;相似地,在探索过程中,个体首先决定要在哪个聚类内进行探索(或选择在所有已存在的聚类之外进行探索),然后再根据移动步长在选定聚类内决定要探索的新地点:这一机制意味着个体对探索的新地点的选择并不像EPR模型中假设的那样是完全随机的,而是受到已存在的访问地点聚类的影响。This application introduces a two-stage decision-making process for individual movement from the perspective of visiting location clustering, that is, individuals first make a movement decision at the cluster level to decide which visiting location cluster to visit, and then make a moving decision at the visiting location level to decide which one to visit Specific places to visit. As shown in Figure 5, in the crowd mobility model, the influence of visit location clustering on individual mobility exists in both the preference return phase and the exploration phase. In the preference return stage, the individual first decides which visit location cluster to visit according to the visit frequency of all visit place clusters, and then decides which specific visit place to visit according to the visit frequency of all visit place clusters in this cluster: this mechanism reflects The attractiveness of the visiting locations that make up the visiting location cluster affects the return probability of the visiting location cluster. In addition, the individual’s visit frequency to a visiting location is not only affected by its own attractiveness, but also by the visiting location cluster to which it belongs. In other words, the influence of the attractiveness of other nearby visited places; similarly, in the exploration process, the individual first decides which cluster to explore in (or chooses to stay outside all existing clusters explore), and then decide new places to explore within the selected cluster according to the moving step size: this mechanism means that the individual's choice of new places to explore is not completely random as assumed in the EPR model, Rather, it is affected by the existing clustering of visited locations.

在测试中,本模型在广州市和美国休斯顿市的两个人群移动轨迹数据集上进行了测试,当在广州市数据集上进行测试时,参考广州市数据集涵盖的时间长度和样本数量,拟生成移动轨迹的个体数量设定为17000个,目标时长设定为5个月。基于广州市数据集的参数拟合结果,本模型中的相关参数设定如下:β=0.618,ρ=0.697,γ=0.199,α=0.615,τ=0.595,ω=-2.848,ε=0.578;当在休斯顿数据集上进行测试时,参考休斯顿数据集涵盖的时间长度和样本数量,拟生成移动轨迹的个体数量设定为17000个,目标时长设定为50周。基于休斯顿市数据集的参数拟合结果,本模型中的相关参数设定如下:β=0.800,ρ=0.647,γ=0.273,α=0.416,τ=0.606,ω=-3.572,ε=0.819。基于上述测试,可以分别得到两个城市的人类移动轨迹中访问地点和其所属聚类的访问频率排序的关联关系,通过对比数据集和本模型生成的人类移动轨迹中访问地点和其所属聚类的访问频率排序的关联关系,可以发现,本模型生成的访问地点和其所属聚类的访问频率排序的关联关系与数据集所呈现出的特征较为一致:即属于高频访问地点聚类的访问地点具有较高的访问频率,且具有较高访问频率的访问地点更可能出现高频访问地点聚类中。而现有的EPR模型在刻画访问地点和其所属聚类的访问频率排序的关联关系上则有明显不足,在EPR模型生成的结果中,不同访问频率的聚类的访问地点组成几乎相同,且具有较高访问频率的访问地点有较高概率出现在低频访问地点聚类中。上述结果证明,本模型能够成功复现人群移动中访问地点和访问地点聚类之间的内在联系,相比于现有的EPR模型有显著提升。其次,通过在模型中引入访问地点聚类,本模型将个体的移动与城市中的物理空间建立了联系,在该模型的基础上,可以实现将人的移动行为与各类城市空间建立联系,有助于进一步模拟城市居民与各类城市空间的交互过程。In the test, this model was tested on two crowd movement trajectory datasets in Guangzhou and Houston, USA. When testing on the Guangzhou dataset, refer to the time length and sample size covered by the Guangzhou dataset. The number of individuals to generate movement trajectories is set to 17,000, and the target duration is set to 5 months. Based on the parameter fitting results of the Guangzhou data set, the relevant parameters in this model are set as follows: β=0.618, ρ=0.697, γ=0.199, α=0.615, τ=0.595, ω=-2.848, ε=0.578; When testing on the Houston dataset, referring to the time length and sample size covered by the Houston dataset, the number of individuals to generate movement trajectories is set to 17,000, and the target duration is set to 50 weeks. Based on the parameter fitting results of the Houston data set, the relevant parameters in this model are set as follows: β=0.800, ρ=0.647, γ=0.273, α=0.416, τ=0.606, ω=-3.572, ε=0.819. Based on the above tests, the relationship between the visiting locations and the visiting frequency rankings of the clusters they belong to in the human movement trajectories of the two cities can be obtained respectively. By comparing the data sets and the human movement trajectories generated by this model, the visiting locations and the clusters they belong to It can be found that the relationship between the visiting locations generated by this model and the visiting frequency rankings of the clusters they belong to is consistent with the characteristics presented by the data set: that is, the visits belonging to the clusters of high-frequency visiting locations Places have higher visit frequency, and visit places with higher visit frequency are more likely to appear in clusters of high-frequency visit places. However, the existing EPR model has obvious deficiencies in describing the relationship between visiting locations and the ranking of visiting frequencies of the clusters they belong to. In the results generated by the EPR model, the composition of visiting locations of clusters with different visiting frequencies is almost the same, and Visit places with higher visit frequencies have a higher probability of appearing in the low-frequency visit place cluster. The above results prove that this model can successfully reproduce the intrinsic relationship between visiting locations and visiting location clusters in crowd movement, which is significantly improved compared with the existing EPR model. Secondly, by introducing clustering of visiting locations into the model, this model establishes a connection between individual movement and the physical space in the city. It is helpful to further simulate the interaction process between urban residents and various urban spaces.

根据本申请实施例提出的基于访问地点聚类的人群移动建模方法,通过从个体移动轨迹中识别访问地点聚类,利用个体在访问地点聚类层面的移动行为得到人群移动特征,除停留时间、移动步长、个体的探索概率随访问地点数量的变化情况、个体的随机探索概率随访问地点数量的变化情况以外,本申请还创新性地考虑了个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布,通过考虑该条件概率分布,本申请能够成功复现人群移动中访问地点和访问地点聚类之间的内在联系,刻画个体移动的重要特征,填补现有模型的不足。进一步引入了个体移动在访问地点聚类视角下的两阶段决策过程,不仅能够预测个体的移动轨迹,还能够预测个体在访问地点聚类之间和访问地点聚类内部的访问模式,复现访问地点和访问地点聚类之间的内在联系,填补了现有模型的缺陷。According to the crowd movement modeling method based on visiting place clustering proposed in the embodiment of the present application, by identifying the visiting place clusters from the individual movement trajectories, the movement behavior of the individual at the visiting place clustering level is used to obtain the crowd movement characteristics, except for the stay time , moving step size, the variation of the individual’s exploration probability with the number of visited locations, and the variation of the individual’s random exploration probability with the number of visited locations, this application also innovatively considers the ordering of the individual’s visit frequencies of all visited locations and their The conditional probability distribution between the visit frequency rankings of the visiting place clusters, by considering the conditional probability distribution, this application can successfully reproduce the internal relationship between the visiting places and visiting place clusters in the crowd movement, and describe the important characteristics of individual movement , to fill in the gaps of existing models. It further introduces a two-stage decision-making process of individual movement from the perspective of visiting location clusters, which can not only predict the individual's movement trajectory, but also predict the individual's access patterns between visiting location clusters and within visiting location clusters, recurring visits Intrinsically linking clusters of places and visited places fills the gaps of existing models.

其次参照附图描述根据本申请实施例提出的基于访问地点聚类的人群移动建模装置。Next, the apparatus for modeling crowd movement based on clustering of visiting locations proposed according to the embodiments of the present application will be described with reference to the accompanying drawings.

图6为根据本申请实施例的基于访问地点聚类的人群移动建模装置的示例图。Fig. 6 is an example diagram of a crowd movement modeling device based on visiting place clustering according to an embodiment of the present application.

如图6所示,该基于访问地点聚类的人群移动建模装置10包括:获取模块100、参数生成模块200和轨迹生成模块300。As shown in FIG. 6 , the apparatus 10 for modeling crowd movement based on clustering of visited locations includes: an acquisition module 100 , a parameter generation module 200 and a trajectory generation module 300 .

其中,获取模块100,用于基于历史人群移动轨迹数据获取人群中多个个体的访问地点和访问地点聚类。Wherein, the acquisition module 100 is configured to acquire the visit locations and visit location clusters of multiple individuals in the crowd based on historical crowd movement trajectory data.

参数生成模块200,用于根据访问地点、访问地点聚类和历史人群移动轨迹提取人群移动特征,并根据人群移动特征生成人群移动模型的模型参数,其中,人群移动特征包括个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布。The parameter generation module 200 is used to extract crowd movement features according to the visiting locations, clustering of visiting spots, and historical crowd movement trajectories, and generate model parameters of the crowd movement model according to the crowd movement features, wherein the crowd movement features include all visit locations of individuals The conditional probability distribution between the visit frequency rankings and the visit frequency rankings of the visit location clusters to which they belong.

轨迹生成模块300,用于将设定的模拟个体数量和目标时长输入人群移动模型,通过人群移动模型生成目标时长内多个模拟个体的移动轨迹。The trajectory generation module 300 is configured to input the set number of simulated individuals and the target duration into the crowd movement model, and generate movement trajectories of multiple simulated individuals within the target duration through the crowd movement model.

可选地,在本申请的一个实施例中,获取模块100包括:识别单元,用于根据历史人群移动轨迹数据确定个体轨迹中的多个位置记录点,计算多个位置记录点的当前速度,将当前速度小于等于预设速度阈值的位置记录点标记为停留位置记录点;第一聚类单元,用于对所有停留位置记录点进行聚类,计算个体在每个位置记录点聚类中的单次停留时间,根据单次停留时间确定个体的访问地点,其中,访问地点的坐标为对应位置记录点聚类的中心;第二聚类单元,用于对个体的所有访问地点进行聚类,得到访问地点聚类。Optionally, in one embodiment of the present application, the acquisition module 100 includes: an identification unit, configured to determine multiple position record points in the individual trajectory according to historical crowd movement track data, and calculate the current speed of the multiple position record points, Mark the position record points whose current speed is less than or equal to the preset speed threshold as the stay position record points; the first clustering unit is used to cluster all the stay position record points, and calculate the individual's position in each position record point cluster A single stay time, according to a single stay time to determine the individual's visit location, where the coordinates of the visit location is the center of the clustering of the corresponding location record points; the second clustering unit is used to cluster all the visit locations of the individual, Get clustering of visiting locations.

可选地,在本申请的一个实施例中,参数生成模块200包括:第一提取单元,用于根据个体对各个访问地点的访问顺序及每次访问的到达时间、离开时间,计算每个个体每次访问的停留时间,并统计所有个体的停留时间分布;第二提取单元,用于根据个体对各访问地点的访问顺序,计算每个个体各次访问之间的移动步长,并统计所有个体的移动步长分布;第三提取单元,用于根据个体的历史移动轨迹,统计在不同访问地点数量下个体的探索概率;第四提取单元,用于根据个体的历史移动轨迹,统计在不同访问地点数量下个体的随机探索概率;第五提取单元,用于将个体的所有访问地点和所有访问地点聚类的访问频率进行排序,计算个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布。Optionally, in one embodiment of the present application, the parameter generation module 200 includes: a first extraction unit, configured to calculate the The residence time of each visit, and count the distribution of the residence time of all individuals; the second extraction unit is used to calculate the moving step size between each visit of each individual according to the order of visits by individuals to each visit location, and count all The individual’s movement step distribution; the third extraction unit is used to count the exploration probability of the individual under different numbers of visited locations according to the individual’s historical movement trajectory; the fourth extraction unit is used to calculate the individual’s historical movement trajectory in different locations. The random exploration probability of the individual under the number of visiting locations; the fifth extraction unit is used to sort all the visiting locations of the individual and the visiting frequencies of all the visiting locations clustered, and calculate the visiting frequency ranking of all the visiting locations of the individual and the visiting locations to which they belong The conditional probability distribution among the visit frequency orderings of the clusters.

可选地,在本申请的一个实施例中,轨迹生成模块300包括:初始单元,用于选定每个模拟个体的初始位置,并设定初始时刻为0;探索单元,用于根据人群移动特征提取模拟个体的停留时间,并根据模拟个体的探索概率进行探索行为或返回行为;探索单元,还用于在模拟个体为探索行为时,根据模拟个体的随机探索概率进行随机探索行为或聚类内探索行为;探索单元,还用于在模拟个体进行随机探索行为时,根据人群移动特征提取移动步长,并在移动步长范围内,确定模拟个体在访问地点聚类以外的任一探索地点并进行访问;探索单元,还用于在模拟个体进行聚类内探索行为时,根据访问地点聚类的访问频率排序确定模拟个体的待访问地点聚类,根据人群移动特征提取移动步长,并在移动步长范围内,确定模拟个体在所述待访问地点聚类内的任一探索地点并进行访问;探索单元,还用于在模拟个体为返回行为时,根据访问地点聚类的访问频率确定模拟个体的待访问地点聚类,根据待访问地点聚类内的访问地点的访问频率确定模拟个体的待访问地点并进行访问;输出单元,用于计算模拟个体在多个访问地点的总停留时间是否大于等于目标时长,若总停留时间大于等于目标时长,则根据多个访问地点的位置和访问顺序生成模拟个体的移动轨迹,若总停留时间小于目标时长,则继续根据模拟个体的探索概率进行探索行为或返回行为,直至总停留时间大于等于目标时长。Optionally, in one embodiment of the present application, the trajectory generation module 300 includes: an initial unit, used to select the initial position of each simulated individual, and set the initial time to 0; an exploration unit, used to move according to the crowd Feature extraction simulates the individual's stay time, and performs exploration behavior or return behavior according to the exploration probability of the simulated individual; the exploration unit is also used to perform random exploration behavior or clustering according to the random exploration probability of the simulated individual when the simulated individual is an exploration behavior Inner exploration behavior; the exploration unit is also used to extract the moving step according to the movement characteristics of the crowd when the simulated individual performs random exploration behavior, and within the range of the moving step, determine any exploration location of the simulated individual outside the cluster of visiting locations and conduct visits; the exploration unit is also used to determine the location clusters to be visited of the simulated individuals according to the visit frequency ranking of the visited location clusters when the simulated individuals perform cluster exploration behavior, and extract the moving step size according to the movement characteristics of the crowd, and Within the range of the moving step, determine and visit any exploration site within the cluster of locations to be visited by the simulated individual; the exploration unit is also used to cluster the visit frequency according to the visited site when the simulated individual is returning behavior Determine the cluster of places to be visited by the simulated individual, determine and visit the places to be visited by the simulated individual according to the visit frequency of the places to be visited in the cluster of places to be visited; the output unit is used to calculate the total stay of the simulated individual at multiple visited places Whether the time is greater than or equal to the target duration. If the total stay time is greater than or equal to the target duration, the movement trajectory of the simulated individual will be generated according to the location and visit sequence of multiple visit locations. If the total stay time is less than the target duration, continue to be based on the exploration probability of the simulated individual. Perform exploration behavior or return behavior until the total stay time is greater than or equal to the target duration.

需要说明的是,前述对基于访问地点聚类的人群移动建模方法实施例的解释说明也适用于该实施例的基于访问地点聚类的人群移动建模装置,此处不再赘述。It should be noted that, the foregoing explanations on the embodiment of the method for modeling crowd movement based on clustering of visited places are also applicable to the apparatus for modeling crowd movement based on clustering of visited places in this embodiment, and details are not repeated here.

根据本申请实施例提出的基于访问地点聚类的人群移动建模装置,通过从个体移动轨迹中识别访问地点聚类,利用个体在访问地点聚类层面的移动行为得到人群移动特征,除停留时间、移动步长、个体的探索概率随访问地点数量的变化情况、个体的随机探索概率随访问地点数量的变化情况以外,本申请还创新性地考虑了个体的所有访问地点的访问频率排序及其所属访问地点聚类的访问频率排序间的条件概率分布,通过考虑该条件概率分布,本申请能够成功复现人群移动中访问地点和访问地点聚类之间的内在联系,刻画个体移动的重要特征,填补现有模型的不足。进一步引入了个体移动在访问地点聚类视角下的两阶段决策过程,不仅能够预测个体的移动轨迹,还能够预测个体在访问地点聚类之间和访问地点聚类内部的访问模式,复现访问地点和访问地点聚类之间的内在联系,填补了现有模型的缺陷。According to the crowd movement modeling device based on visiting place clustering proposed in the embodiment of the present application, by identifying visiting place clusters from individual movement trajectories, the movement behavior of individuals at the visiting place clustering level is used to obtain crowd movement characteristics, except for the stay time , moving step size, the variation of the individual’s exploration probability with the number of visited locations, and the variation of the individual’s random exploration probability with the number of visited locations, this application also innovatively considers the ordering of the individual’s visit frequencies of all visited locations and their The conditional probability distribution between the visit frequency rankings of the visiting place clusters, by considering the conditional probability distribution, this application can successfully reproduce the internal relationship between the visiting places and visiting place clusters in the crowd movement, and describe the important characteristics of individual movement , to fill in the gaps of existing models. It further introduces a two-stage decision-making process of individual movement from the perspective of visiting location clusters, which can not only predict the individual's movement trajectory, but also predict the individual's access patterns between visiting location clusters and within visiting location clusters, recurring visits Intrinsically linking clusters of places and visited places fills the gaps of existing models.

在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本申请的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或N个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, descriptions referring to the terms "one embodiment", "some embodiments", "example", "specific examples", or "some examples" mean that specific features described in connection with the embodiment or example , structure, material or characteristic is included in at least one embodiment or example of the present application. In this specification, the schematic representations of the above terms are not necessarily directed to the same embodiment or example. Moreover, the described specific features, structures, materials or characteristics may be combined in any one or N embodiments or examples in an appropriate manner. In addition, those skilled in the art can combine and combine different embodiments or examples and features of different embodiments or examples described in this specification without conflicting with each other.

此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本申请的描述中,“N个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。In addition, the terms "first" and "second" are used for descriptive purposes only, and cannot be interpreted as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, the features defined as "first" and "second" may explicitly or implicitly include at least one of these features. In the description of the present application, "N" means at least two, such as two, three, etc., unless otherwise specifically defined.

流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更N个用于实现定制逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本申请的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本申请的实施例所属技术领域的技术人员所理解。Any process or method description in a flowchart or otherwise described herein may be understood to represent a module, segment or portion of code comprising one or more executable instructions for implementing a custom logical function or step of a process , and the scope of preferred embodiments of the present application includes additional implementations in which functions may be performed out of the order shown or discussed, including in substantially simultaneous fashion or in reverse order depending on the functions involved, which shall It should be understood by those skilled in the art to which the embodiments of the present application belong.

Claims (8)

1. The crowd movement modeling method based on the access place clustering is characterized by comprising the following steps of:
acquiring access sites and access site clusters of a plurality of individuals in a crowd based on historical crowd movement track data;
extracting crowd movement features according to the access places, the access place clusters and the historical crowd movement tracks, and generating model parameters of a crowd movement model according to the crowd movement features, wherein the crowd movement features comprise access frequency sequences of all access places of individuals and conditional probability distribution among access frequency sequences of the access place clusters to which the individuals belong;
Inputting the set number of simulated individuals and the target duration into the crowd movement model, and generating movement tracks of a plurality of simulated individuals in the target duration through the crowd movement model.
2. The method of claim 1, wherein the obtaining access locations and access location clusters for a plurality of individuals in the crowd based on the historical crowd movement trajectory data comprises:
determining a plurality of position record points in an individual track according to the historical crowd movement track data, calculating the current speeds of the position record points, and marking the position record points with the current speeds less than or equal to a preset speed threshold as stay position record points;
clustering all the stay position record points, calculating single stay time of an individual in each position record point cluster, and determining access places of the individual according to the single stay time, wherein coordinates of the access places are centers of the corresponding position record point clusters;
clustering all access places of the individuals to obtain the access place clusters.
3. The method of claim 1, wherein extracting crowd movement features from the visit location, the visit location cluster, and the historical crowd movement trajectories comprises:
Calculating the residence time of each individual visit according to the visit sequence of each individual visit to each visit place, the arrival time and the departure time of each visit, and counting the residence time distribution of all individuals;
calculating the moving step length between each visit of each individual according to the visit sequence of each individual to each visit place, and counting the moving step length distribution of all the individuals;
counting the exploration probability of the individual under different access places according to the historical movement track of the individual;
according to the historical movement track of the individual, counting random exploration probabilities of the individual under different access places;
sorting all access places of the individual and the access frequencies of all access place clusters, and calculating the access frequency sorting of all access places of the individual and the conditional probability distribution among the access frequency sorting of the access place clusters to which the access places belong.
4. A method according to any one of claims 1-3, wherein generating movement trajectories for a plurality of simulated individuals within the target time period from the crowd movement model comprises:
selecting an initial position of each simulation individual, and setting the initial time to be 0;
extracting the residence time of the simulated individuals according to the crowd movement characteristics, and carrying out exploration behaviors or return behaviors according to the exploration probability of the simulated individuals;
When the simulated individual is the exploration behavior, carrying out random exploration behavior or intra-cluster exploration behavior according to the random exploration probability of the simulated individual;
when the simulated individuals conduct random exploration behaviors, extracting movement step length according to the crowd movement characteristics, and determining any exploration place of the simulated individuals outside the visit place cluster in the movement step length range for visit;
when the simulated individuals perform intra-cluster exploration behaviors, determining a to-be-visited place cluster of the simulated individuals according to the visit frequency sequence of the visit place cluster, extracting the moving step length according to the crowd moving characteristics, and determining any exploration place of the simulated individuals in the to-be-visited place cluster and visiting the exploration place within the moving step length range;
when the simulated individual is in a return behavior, determining a to-be-accessed place cluster of the simulated individual according to the access frequency of the access place cluster, and determining and accessing the to-be-accessed place of the simulated individual according to the access frequency of the access place in the to-be-accessed place cluster;
calculating whether the total residence time of the simulation individual in a plurality of access places is larger than or equal to the target duration, if the total residence time is larger than or equal to the target duration, generating a moving track of the simulation individual according to the positions and the access sequence of the plurality of access places, and if the total residence time is smaller than the target duration, continuing to search or return according to the exploration probability of the simulation individual until the total residence time is larger than or equal to the target duration.
5. A crowd movement modeling apparatus based on access location clustering, comprising:
the acquisition module is used for acquiring access sites and access site clusters of a plurality of individuals in the crowd based on the historical crowd movement track data;
the parameter generation module is used for extracting crowd movement characteristics according to the access places, the access place clusters and the historical crowd movement tracks, and generating model parameters of a crowd movement model according to the crowd movement characteristics, wherein the crowd movement characteristics comprise access frequency sequences of all access places of individuals and conditional probability distribution among the access frequency sequences of the access place clusters to which the individual access places belong;
the track generation module is used for inputting the set number of the simulated individuals and the target time length into the crowd movement model, and generating movement tracks of the plurality of the simulated individuals in the target time length through the crowd movement model.
6. The apparatus of claim 5, wherein the acquisition module comprises:
the identification unit is used for determining a plurality of position record points in the individual track according to the historical crowd moving track data, calculating the current speed of the position record points, and marking the position record points with the current speed less than or equal to a preset speed threshold as stay position record points;
The first clustering unit is used for clustering all the stay position record points, calculating single stay time of an individual in each position record point cluster, and determining access places of the individual according to the single stay time, wherein the coordinates of the access places are the centers of the corresponding position record point clusters;
and the second clustering unit is used for clustering all the access places of the individuals to obtain the access place clusters.
7. The apparatus of claim 5, wherein the parameter generation module comprises:
the first extraction unit is used for calculating the stay time of each access of each individual according to the access sequence of the individual to each access place, the arrival time and the departure time of each access, and counting the stay time distribution of all the individuals;
the second extraction unit is used for calculating the moving step length between each visit of each individual according to the visit sequence of the individual to each visit place and counting the moving step length distribution of all the individuals;
the third extraction unit is used for counting the exploration probability of the individuals under different access places according to the historical movement track of the individuals;
a fourth extraction unit, configured to count random exploration probabilities of individuals under different access location numbers according to historical movement tracks of the individuals;
And a fifth extraction unit, configured to rank all access sites of the individual and access frequencies of all access site clusters, and calculate a conditional probability distribution between the access frequency ranks of all access sites of the individual and the access frequency ranks of the access site clusters to which the access sites belong.
8. The apparatus of any of claims 5-7, wherein the trajectory generation module comprises:
the initial unit is used for selecting the initial position of each simulation individual and setting the initial time to be 0;
the exploration unit is used for extracting the residence time of the analog individuals according to the crowd movement characteristics and carrying out exploration behaviors or return behaviors according to the exploration probability of the analog individuals;
the exploration unit is further used for conducting random exploration behaviors or intra-cluster exploration behaviors according to random exploration probabilities of the simulated individuals when the simulated individuals are exploration behaviors;
the exploration unit is further used for extracting a moving step length according to the crowd moving characteristics when the simulated individuals conduct random exploration behaviors, and determining any exploration place of the simulated individuals outside the visit place cluster and visiting the exploration place within the moving step length range;
The exploration unit is further used for determining a to-be-visited place cluster of the analog individual according to the visit frequency sequence of the visit place cluster when the analog individual performs intra-cluster exploration behaviors, extracting the moving step length according to the crowd moving characteristics, and determining any exploration place of the analog individual in the to-be-visited place cluster and visiting the exploration place within the moving step length range;
the exploration unit is further used for determining a to-be-accessed place cluster of the simulated individual according to the access frequency of the access place cluster when the simulated individual is in return behavior, and determining and accessing the to-be-accessed place of the simulated individual according to the access frequency of the access place in the to-be-accessed place cluster;
the output unit is used for calculating whether the total residence time of the simulation individual at a plurality of access places is greater than or equal to the target duration, if the total residence time is greater than or equal to the target duration, generating a moving track of the simulation individual according to the positions and the access sequence of the plurality of access places, and if the total residence time is less than the target duration, continuing to perform exploration or return according to the exploration probability of the simulation individual until the total residence time is greater than or equal to the target duration.
CN202310281628.2A 2023-03-22 2023-03-22 Crowd movement modeling method and device based on visiting location clustering Active CN115994313B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310281628.2A CN115994313B (en) 2023-03-22 2023-03-22 Crowd movement modeling method and device based on visiting location clustering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310281628.2A CN115994313B (en) 2023-03-22 2023-03-22 Crowd movement modeling method and device based on visiting location clustering

Publications (2)

Publication Number Publication Date
CN115994313A true CN115994313A (en) 2023-04-21
CN115994313B CN115994313B (en) 2023-05-30

Family

ID=85993686

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310281628.2A Active CN115994313B (en) 2023-03-22 2023-03-22 Crowd movement modeling method and device based on visiting location clustering

Country Status (1)

Country Link
CN (1) CN115994313B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106407519A (en) * 2016-08-31 2017-02-15 浙江大学 Modeling method for crowd moving rule
CN106951903A (en) * 2016-10-31 2017-07-14 浙江大学 A Visualization Method of Crowd Movement Law
US20190122229A1 (en) * 2017-10-19 2019-04-25 International Business Machines Corporation Recognizing recurrent crowd mobility patterns
CN113486927A (en) * 2021-06-15 2021-10-08 北京大学 Unsupervised track access place labeling method based on prior probability
CN113505314A (en) * 2021-07-27 2021-10-15 王程 Position track analysis system for space-time complex network clustering
CN114580539A (en) * 2022-03-04 2022-06-03 京东鲲鹏(江苏)科技有限公司 A vehicle driving strategy processing method and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106407519A (en) * 2016-08-31 2017-02-15 浙江大学 Modeling method for crowd moving rule
CN106951903A (en) * 2016-10-31 2017-07-14 浙江大学 A Visualization Method of Crowd Movement Law
US20190122229A1 (en) * 2017-10-19 2019-04-25 International Business Machines Corporation Recognizing recurrent crowd mobility patterns
CN113486927A (en) * 2021-06-15 2021-10-08 北京大学 Unsupervised track access place labeling method based on prior probability
CN113505314A (en) * 2021-07-27 2021-10-15 王程 Position track analysis system for space-time complex network clustering
CN114580539A (en) * 2022-03-04 2022-06-03 京东鲲鹏(江苏)科技有限公司 A vehicle driving strategy processing method and device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
XIAOLU XIA等: "Human motion prediction for intelligent construction: A review", 《AUTOMATION IN CONSTRUCTION》 *
李帆等: "顾及停留位置特征提取的个人位置预测方法", 《武汉大学学报(信息科学版)》 *

Also Published As

Publication number Publication date
CN115994313B (en) 2023-05-30

Similar Documents

Publication Publication Date Title
Yin et al. Joint modeling of user check-in behaviors for real-time point-of-interest recommendation
Yang et al. Predicting next location using a variable order Markov model
Jiao et al. A novel next new point-of-interest recommendation system based on simulated user travel decision-making process
Piovani et al. Urban retail location: Insights from percolation theory and spatial interaction modeling
CN108829761A (en) A kind of point of interest recommended method, system, medium and equipment
CN113158038A (en) Interest point recommendation method and system based on STA-TCN neural network framework
CN116437291B (en) A cultural circle planning method and system based on mobile phone signaling
CN109948066A (en) A Point-of-Interest Recommendation Method Based on Heterogeneous Information Network
KR102223640B1 (en) Cloud-based personalized contents subscription service providing system and method thereof
CN109165691A (en) Training method, device and the electronic equipment of the model of cheating user for identification
Xing et al. Flow trace: A novel representation of intra-urban movement dynamics
Kavak et al. Location-based social simulation
CN116598014A (en) Medical missing data complement method based on graph attention mechanism and language big model
Gong et al. Single cell lineage reconstruction using distance-based algorithms and the R package, DCLEAR
Elgar et al. Simulations of firm location decisions: Replicating office location choices in the Greater Toronto Area
Geertman et al. Spatial‐temporal specific neighbourhood rules for cellular automata land‐use modelling
CN113255951B (en) A method and system for generating a moving trajectory
KR102615650B1 (en) Crowd movement prediction method based on generative adversarial network
Keskin et al. Cohort fertility heterogeneity during the fertility decline period in Turkey
CN115994313B (en) Crowd movement modeling method and device based on visiting location clustering
Efiong GIS-based network analysis for optimisation of public facilities closure: A study on libraries in Leicestershire, United Kingdom
Malleson et al. Place-based simulation modeling: agent-based modeling and virtual environments
Pugliese et al. Understanding human mobility dynamics: Insights from summarized semantic trajectories
Jaegal et al. Measuring the structural similarity of network time prisms using temporal signatures with graph indices
Li Joint Modeling of User Behaviors Based on Variable‐Order Additive Markov Chain for POI Recommendation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant