CN118433649A - Permanent population identification method based on mobile phone signaling data - Google Patents
Permanent population identification method based on mobile phone signaling data Download PDFInfo
- Publication number
- CN118433649A CN118433649A CN202410611548.3A CN202410611548A CN118433649A CN 118433649 A CN118433649 A CN 118433649A CN 202410611548 A CN202410611548 A CN 202410611548A CN 118433649 A CN118433649 A CN 118433649A
- Authority
- CN
- China
- Prior art keywords
- mobile phone
- user
- signaling data
- permanent
- day
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000011664 signaling Effects 0.000 title claims abstract description 94
- 238000000034 method Methods 0.000 title claims abstract description 41
- 238000007781 pre-processing Methods 0.000 claims abstract description 11
- 230000000694 effects Effects 0.000 claims description 20
- 230000002159 abnormal effect Effects 0.000 claims description 4
- 238000010220 Pearson correlation analysis Methods 0.000 claims description 3
- 230000003203 everyday effect Effects 0.000 claims 1
- 238000010586 diagram Methods 0.000 description 4
- 101100233916 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KAR5 gene Proteins 0.000 description 3
- 101001121408 Homo sapiens L-amino-acid oxidase Proteins 0.000 description 2
- 102100026388 L-amino-acid oxidase Human genes 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/20—Services signaling; Auxiliary data signalling, i.e. transmitting data via a non-traffic channel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/02—Services making use of location information
- H04W4/029—Location-based management or tracking services
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Telephonic Communication Services (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
Description
技术领域Technical Field
本发明涉及手机信令数据分析与城市规划技术领域,具体涉及一种基于手机信令数据的常住人口识别方法。The present invention relates to the technical field of mobile phone signaling data analysis and urban planning, and in particular to a method for identifying permanent residents based on mobile phone signaling data.
背景技术Background technique
常住人口是指在某一地区居住超过六个月的人口,不仅可以反映该地区的人口数量和人口密度,也是研究社会经济发展,配置公共服务设施的重要依据,是进行城市规划和城市精细化治理的重要支撑数据。目前,获取城市常住人口数量的主要方式是通过每十年进行一次的人口普查。但是,进行人口普查的周期较长,每十年才进行一次,数据更新的速度远远落后城市发展,不能为城市规划和城市治理提供及时的人口数据。Permanent population refers to the population that has lived in a certain area for more than six months. It not only reflects the population size and population density of the area, but also is an important basis for studying social and economic development and allocating public service facilities. It is also an important supporting data for urban planning and refined urban governance. At present, the main way to obtain the number of permanent residents in a city is through a population census conducted every ten years. However, the cycle of conducting a population census is relatively long, and it is only conducted once every ten years. The speed of data update lags far behind urban development, and cannot provide timely population data for urban planning and urban governance.
随着信息技术的发展,手机信令数据得到了越来越广泛的使用。手机信令数具有较高的空间精度、丰富的时空信息、较全面的样本覆盖,可以记录用户行为,识别人口数量,为城市规划决策提供依据。但是由于购买和获取手机信令数据较昂贵,规划师往往只能获得一个月而非六个月或更长时间的手机信令数据,无法判断某个用户是否在该地区居住超过六个月。With the development of information technology, mobile phone signaling data has been used more and more widely. Mobile phone signaling data has high spatial accuracy, rich spatiotemporal information, and comprehensive sample coverage. It can record user behavior, identify population numbers, and provide a basis for urban planning decisions. However, since it is expensive to purchase and obtain mobile phone signaling data, planners can often only obtain mobile phone signaling data for one month instead of six months or longer, and cannot determine whether a user has lived in the area for more than six months.
因此,规划师目前只能采用简单设定阈值的方法将符合要求的手机用户识别为常住人口。这种方法科学依据不足,且准确率较低,进而难以倚靠手机信令数据来识别常住人口。Therefore, planners can only use a simple threshold setting method to identify mobile phone users who meet the requirements as permanent residents. This method lacks scientific basis and has a low accuracy rate, making it difficult to rely on mobile phone signaling data to identify permanent residents.
发明内容Summary of the invention
本发明是为了解决上述问题而进行的,目的在于提供一种基于手机信令数据的常住人口识别方法。The present invention is made to solve the above-mentioned problem, and aims to provide a method for identifying permanent residents based on mobile phone signaling data.
本发明提供了一种基于手机信令数据的常住人口识别方法,用于得到目标地区的常住人口数量,具有这样的特征,包括以下步骤:步骤S1,采集连续K天目标地区内各个用户的手机信令数据;步骤S2,对手机信令数据进行预处理,得到预处理手机信令数据;步骤S3,对各个用户,根据对应的预处理手机信令数据计算得到该用户对应的居住地;步骤S4,根据居住地和预处理手机信令数据,将对应的各个用户标记为稳定居住用户或非稳定居住用户;步骤S5,对各个稳定居住用户,根据对应的预处理手机信令数据和预设的单日驻留时间阈值M计算得到该稳定居住用户在K天内的有效停留天数;步骤S6,将有效停留天数大于等于预设的常住阈值N的所有稳定居住用户的总数作为常住人口数量,其中,稳定居住用户在目标地区的单日停留时长大于单日驻留时间阈值M时,当天为稳定居住用户的有效停留日,K天内所有有效停留日的总和为有效停留天数。The present invention provides a method for identifying permanent residents based on mobile phone signaling data, which is used to obtain the number of permanent residents in a target area and has the following characteristics: step S1, collecting mobile phone signaling data of each user in the target area for K consecutive days; step S2, preprocessing the mobile phone signaling data to obtain preprocessed mobile phone signaling data; step S3, for each user, calculating the corresponding place of residence of the user according to the corresponding preprocessed mobile phone signaling data; step S4, marking each corresponding user as a stable resident user or an unstable resident user according to the place of residence and the preprocessed mobile phone signaling data; step S5, for each stable resident user, calculating the effective number of days of stay of the stable resident user within K days according to the corresponding preprocessed mobile phone signaling data and a preset single-day residence time threshold M; step S6, taking the total number of all stable resident users whose effective number of days of stay is greater than or equal to the preset permanent threshold N as the number of permanent residents, wherein when the single-day residence time of the stable resident user in the target area is greater than the single-day residence time threshold M, the day is the effective residence day of the stable resident user, and the sum of all effective residence days within K days is the effective residence day.
在本发明提供的基于手机信令数据的常住人口识别方法中,还可以具有这样的特征:其中,手机信令数据包括年龄和性别,在步骤S2中,预处理包括剔除年龄或性别为异常值的手机信令数据。The method for identifying permanent residents based on mobile phone signaling data provided by the present invention may also have the following features: wherein the mobile phone signaling data includes age and gender, and in step S2, preprocessing includes eliminating mobile phone signaling data with abnormal age or gender.
在本发明提供的基于手机信令数据的常住人口识别方法中,还可以具有这样的特征:其中,步骤S3包括以下子步骤:步骤S3-1,根据用户的预处理手机信令数据,得到该用户在每天的指定时间段内在目标地区的各个活动地点和对应的停留时长;步骤S3-2,将K天内各个活动地点对应的停留时长累加,得到对应的停留时长总和;步骤S3-3,将停留时长总和最大的活动地点作为居住地。In the permanent resident population identification method based on mobile phone signaling data provided by the present invention, it can also have the following characteristics: wherein, step S3 includes the following sub-steps: step S3-1, based on the user's pre-processed mobile phone signaling data, obtaining the various activity locations and corresponding stay durations of the user in the target area within a specified time period of each day; step S3-2, accumulating the corresponding stay durations of each activity location within K days to obtain the corresponding total stay duration; step S3-3, taking the activity location with the largest total stay duration as the place of residence.
在本发明提供的基于手机信令数据的常住人口识别方法中,还可以具有这样的特征:其中,指定时间段为第一天晚上9点至第二天早上8点。The method for identifying permanent residents based on mobile phone signaling data provided by the present invention may also have the following feature: wherein the designated time period is from 9:00 p.m. on the first day to 8:00 a.m. on the second day.
在本发明提供的基于手机信令数据的常住人口识别方法中,还可以具有这样的特征:其中,在步骤S4中,判断用户是否满足判断条件,若是,则用户为稳定居住用户,若否,则用户为非稳定居住用户,判断条件的表达式为:式中a为在K天内用户出现在居住地的总天数,b为在K天内用户出现在目标地区的总天数,c为用户对应的居住地对应的停留时间总和,d为用户对应的所有活动地点对应的停留时间总和的总数,m和n均为预设的阈值。The method for identifying permanent residents based on mobile phone signaling data provided by the present invention may also have the following features: wherein, in step S4, it is determined whether the user meets the determination condition. If so, the user is a stable resident user; if not, the user is an unstable resident user. The determination condition is expressed as follows: Where a is the total number of days the user appears in his/her residence within K days, b is the total number of days the user appears in the target area within K days, c is the total length of stay at the user's corresponding residence, d is the total length of stay at all activity locations corresponding to the user, and m and n are both preset thresholds.
在本发明提供的基于手机信令数据的常住人口识别方法中,还可以具有这样的特征:其中,m=80,n=50。The method for identifying permanent residents based on mobile phone signaling data provided by the present invention may also have the following features: wherein m=80, n=50.
在本发明提供的基于手机信令数据的常住人口识别方法中,还可以具有这样的特征:其中,根据目标地区对应的现有的人口普查数据和对应的连续K天的历史手机信令数据,计算得到单日驻留时间阈值M和常住阈值N,包括以下步骤:步骤T1,设置初始单日驻留时间阈值、第一步长、最大时间阈值、初始常住阈值、第二步长和最大常住阈值;步骤T2,对历史手机信令数据进行预处理,得到各个历史用户的预处理历史手机信令数据;步骤T3,对各个历史用户,根据对应的预处理历史手机信令数据计算得到该历史用户对应的居住地;步骤T4,根据居住地和预处理历史手机信令数据,将对应的各个历史用户标记为历史稳定居住用户或历史非稳定居住用户;步骤T5,将初始单日驻留时间阈值作为调试单日驻留时间阈值,将初始常住阈值作为调试常住阈值;步骤T6,对各个历史稳定居住用户,将该历史稳定居住用户在目标地区的单日停留时长大于调试单日驻留时间阈值的当天作为历史有效停留日;步骤T7,对各个历史稳定居住用户,将所有对应的历史有效停留日的总数作为历史有效停留天数;步骤T8,将历史有效停留天数大于等于调试常住阈值的所有历史稳定居住用户的总数作为历史常住人口数量;步骤T9,将历史常住人口数量和人口普查数据进行Pearson相关性分析,得到相关性P;步骤T10,根据历史常住人口数量和人口普查数据,计算得到历史常住人口数量与人口普查数据中目标地区总人口数量的比值o;步骤T11,判断调试单日驻留时间阈值是否大于最大时间阈值,若是,则执行步骤T12,若否,则将调试单日驻留时间阈值与第一步长的和作为新的调试单日驻留时间阈值,并执行步骤T6;步骤T12,判断调试常住阈值是否大于最大常住阈值,若是,则执行步骤T13,若否,则将初始单日驻留时间阈值作为调试单日驻留时间阈值,将调试常住阈值与第二步长的和作为新的调试常住阈值,并执行步骤T6;步骤T13,判断最大的相关性P是否唯一,若是,则将最大的相关性P作为最优相关性P,若否,则将所有最大的相关性P中具有最大的比值o的相关性P作为最优相关性P;步骤T14,将最优相关性P对应的调试单日驻留时间阈值作为单日驻留时间阈值M,并将最优相关性P对应的调试常住阈值作为常住阈值N。The method for identifying permanent residents based on mobile phone signaling data provided by the present invention may also have the following features: wherein, according to the existing population census data corresponding to the target area and the corresponding historical mobile phone signaling data for K consecutive days, a single-day residence time threshold M and a permanent residence threshold N are calculated, comprising the following steps: step T1, setting an initial single-day residence time threshold, a first step length, a maximum time threshold, an initial permanent residence threshold, a second step length and a maximum permanent residence threshold; step T2, pre-processing the historical mobile phone signaling data to obtain pre-processed historical mobile phone signaling data of each historical user; step T3, for each historical user, calculating the historical user according to the corresponding pre-processed historical mobile phone signaling data The residence corresponding to the household; step T4, according to the residence and the pre-processed historical mobile phone signaling data, the corresponding historical users are marked as historical stable resident users or historical unstable resident users; step T5, the initial single-day residence time threshold is used as the debugging single-day residence time threshold, and the initial permanent residence threshold is used as the debugging permanent residence threshold; step T6, for each historical stable resident user, the day on which the single-day residence time of the historical stable resident user in the target area is greater than the debugging single-day residence time threshold is used as the historical effective residence day; step T7, for each historical stable resident user, the total number of all corresponding historical effective residence days is used as the historical effective residence days; step T8, the historical effective residence days The total number of all historical stable resident users greater than or equal to the debugging permanent threshold is taken as the historical permanent population; step T9, the historical permanent population and the census data are subjected to Pearson correlation analysis to obtain the correlation P; step T10, based on the historical permanent population and the census data, the ratio o of the historical permanent population to the total population of the target area in the census data is calculated; step T11, whether the debugging single-day residence time threshold is greater than the maximum time threshold, if so, execute step T12, if not, the sum of the debugging single-day residence time threshold and the first step length is taken as the new debugging single-day residence time threshold, and execute step T6; step T12, determine Whether the debugging permanent threshold is greater than the maximum permanent threshold, if so, execute step T13, if not, use the initial single-day residence time threshold as the debugging single-day residence time threshold, use the sum of the debugging permanent threshold and the second step length as the new debugging permanent threshold, and execute step T6; step T13, determine whether the maximum correlation P is unique, if so, use the maximum correlation P as the optimal correlation P, if not, use the correlation P with the largest ratio o among all the maximum correlations P as the optimal correlation P; step T14, use the debugging single-day residence time threshold corresponding to the optimal correlation P as the single-day residence time threshold M, and use the debugging permanent threshold corresponding to the optimal correlation P as the permanent threshold N.
在本发明提供的基于手机信令数据的常住人口识别方法中,还可以具有这样的特征:其中,初始单日驻留时间阈值为0,第一步长为3,最大时间阈值为12,初始常住阈值为10,第二步长为1,最大常住阈值为K。In the permanent resident population identification method based on mobile phone signaling data provided by the present invention, it can also have the following characteristics: wherein, the initial single-day residence time threshold is 0, the first step length is 3, the maximum time threshold is 12, the initial permanent threshold is 10, the second step length is 1, and the maximum permanent threshold is K.
发明的作用与效果Functions and Effects of the Invention
根据本发明所涉及的基于手机信令数据的常住人口识别方法,因为,一方面,通过手机信令数据和符合人日常作息需求的指定时间段计算得到各个用户的居住地,并根据各个用户在居住地的停留时间数据判定该用户是否为目标地区的稳定居住用户;另一方面,根据人口普查数据和对应的历史手机信令数据计算得到符合目标地区的单日驻留时间阈值M和常住阈值N,再根据该单日驻留时间阈值M和常住阈值N对稳定居住用户在该目标地区的活动数据进行分析,从而判断该稳定居住用户是否为该目标地区的常住人口。所以,本发明的基于手机信令数据的常住人口识别方法能够准确地得到目标地区的常住人口数量。According to the method for identifying permanent residents based on mobile phone signaling data involved in the present invention, on the one hand, the residence of each user is calculated through mobile phone signaling data and a designated time period that meets the daily work and rest needs of people, and whether the user is a stable resident user in the target area is determined based on the residence time data of each user in the residence; on the other hand, the single-day residence time threshold M and the permanent threshold N that meet the target area are calculated based on the census data and the corresponding historical mobile phone signaling data, and then the activity data of the stable resident user in the target area is analyzed based on the single-day residence time threshold M and the permanent threshold N, so as to determine whether the stable resident user is the permanent population of the target area. Therefore, the method for identifying permanent residents based on mobile phone signaling data of the present invention can accurately obtain the number of permanent residents in the target area.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是本发明的实施例中基于手机信令数据的常住人口识别方法的流程示意图;1 is a schematic flow chart of a method for identifying permanent residents based on mobile phone signaling data in an embodiment of the present invention;
图2是本发明的实施例中根据预处理手机信令数据计算得到居住地的流程示意图;2 is a schematic diagram of a process for calculating a place of residence based on pre-processed mobile phone signaling data in an embodiment of the present invention;
图3是本发明的实施例中计算单日驻留时间阈值M和常住阈值N的流程示意图。FIG3 is a schematic diagram of a flow chart of calculating a single-day residence time threshold M and a permanent residence threshold N in an embodiment of the present invention.
具体实施方式Detailed ways
为了使本发明实现的技术手段、创作特征、达成目的与功效易于明白了解,以下实施例结合附图对本发明基于手机信令数据的常住人口识别方法作具体阐述。In order to make the technical means, creative features, objectives and effects achieved by the present invention easy to understand, the following embodiments and the accompanying drawings specifically illustrate the method for identifying permanent residents based on mobile phone signaling data of the present invention.
本实施例中提供一种基于手机信令数据的常住人口识别方法,用于得到目标地区的常住人口数量。This embodiment provides a method for identifying permanent residents based on mobile phone signaling data, which is used to obtain the number of permanent residents in a target area.
图1是本发明的实施例中基于手机信令数据的常住人口识别方法的流程示意图。FIG1 is a flow chart of a method for identifying permanent residents based on mobile phone signaling data in an embodiment of the present invention.
如图1所示,本实施例的基于手机信令数据的常住人口识别方法包括以下步骤:As shown in FIG1 , the method for identifying permanent residents based on mobile phone signaling data of this embodiment includes the following steps:
步骤S1,采集连续K天目标地区内各个用户的手机信令数据。现有的手机信令数据采购单位为月,即28-31天,本实施例中K=30。Step S1, collect the mobile phone signaling data of each user in the target area for K consecutive days. The existing mobile phone signaling data procurement unit is month, that is, 28-31 days, and in this embodiment, K=30.
步骤S2,对手机信令数据进行预处理,得到预处理手机信令数据。Step S2, preprocessing the mobile phone signaling data to obtain preprocessed mobile phone signaling data.
其中,手机信令数据包括年龄和性别。预处理包括剔除年龄或性别为异常值的手机信令数据,本实施例中异常值包括缺失值和不符合常规范围的数值。The mobile phone signaling data includes age and gender. The preprocessing includes removing the mobile phone signaling data with abnormal age or gender. In this embodiment, the abnormal values include missing values and values that do not conform to the normal range.
步骤S3,对各个用户,根据对应的预处理手机信令数据计算得到该用户对应的居住地。Step S3: For each user, the residence corresponding to the user is calculated based on the corresponding pre-processed mobile phone signaling data.
图2是本发明的实施例中根据预处理手机信令数据计算得到居住地的流程示意图。FIG. 2 is a schematic diagram of a flow chart of calculating a place of residence based on pre-processed mobile phone signaling data in an embodiment of the present invention.
如图2所示,步骤S3包括以下子步骤:As shown in FIG. 2 , step S3 includes the following sub-steps:
步骤S3-1,根据用户的预处理手机信令数据,得到该用户在每天的指定时间段内在目标地区的各个活动地点和对应的停留时长。其中,指定时间段根据人的日常休息时间设置得到,本实施例中指定时间段为第一天晚上9点至第二天早上8点,则用户在第一天晚上9点至第二天早上8点的时间段内在活动地点A的停留时间的总和,为该用户在活动地点A的停留时长。Step S3-1, based on the user's pre-processed mobile phone signaling data, obtain the various activity locations and corresponding stay durations of the user in the target area during the specified time period of each day. The specified time period is set according to the person's daily rest time. In this embodiment, the specified time period is from 9 pm on the first day to 8 am on the second day. The sum of the user's stay time at activity location A during the time period from 9 pm on the first day to 8 am on the second day is the stay duration of the user at activity location A.
步骤S3-2,将K天内各个活动地点对应的停留时长累加,得到对应的停留时长总和。Step S3-2, accumulating the corresponding stay durations at each activity location within K days to obtain the corresponding total stay duration.
步骤S3-3,将停留时长总和最大的活动地点作为居住地。Step S3-3, taking the activity location with the largest total stay time as the residence.
步骤S4,根据居住地和预处理手机信令数据,将对应的各个用户标记为稳定居住用户或非稳定居住用户。Step S4, marking each corresponding user as a stable resident user or an unstable resident user according to the place of residence and the pre-processed mobile phone signaling data.
其中,在步骤S4中,判断用户是否满足判断条件,若是,则用户为稳定居住用户,若否,则用户为非稳定居住用户。Among them, in step S4, it is determined whether the user meets the determination conditions. If so, the user is a stable resident user, and if not, the user is an unstable resident user.
判断条件的表达式为:The expression of the judgment condition is:
式中a为在K天内用户出现在居住地的总天数,b为在K天内用户出现在目标地区的总天数,c为用户对应的居住地对应的停留时间总和,d为用户对应的所有活动地点对应的停留时间总和的总数,m和n均为预设的阈值。本实施例中m=80,n=50。本实施例中停留时间总和c为用户在K天的各个指定时间段即第一天晚上9点至第二天早上8点的时间段内停留在居住地的时间总和。总数d为用户在K天的各个指定时间段即第一天晚上9点至第二天早上8点的时间段内停留在各个活动地点的时间的总和。Where a is the total number of days that the user appears in the residence within K days, b is the total number of days that the user appears in the target area within K days, c is the total stay time corresponding to the residence corresponding to the user, d is the total sum of the stay time corresponding to all activity locations corresponding to the user, and m and n are both preset thresholds. In this embodiment, m=80, n=50. In this embodiment, the total stay time c is the total time that the user stays in the residence during each specified time period of K days, that is, from 9 pm on the first day to 8 am on the second day. The total d is the total time that the user stays in each activity location during each specified time period of K days, that is, from 9 pm on the first day to 8 am on the second day.
步骤S5,对各个稳定居住用户,根据对应的预处理手机信令数据和预设的单日驻留时间阈值M计算得到该稳定居住用户在K天内的有效停留天数。Step S5, for each stable resident user, the effective stay days of the stable resident user within K days are calculated according to the corresponding pre-processed mobile phone signaling data and the preset single-day residence time threshold M.
其中,稳定居住用户在目标地区的单日停留时长大于单日驻留时间阈值M时,当天为稳定居住用户的有效停留日,K天内所有有效停留日的总和为有效停留天数。本实施例中单日驻留时间阈值M的单位为小时。本实施例中单日停留时长为稳定居住用户一天即全天内在目标地区的停留时间的总和。When the single-day residence time of a stable resident user in the target area is greater than the single-day residence time threshold M, the day is the effective residence day of the stable resident user, and the sum of all effective residence days within K days is the effective residence days. In this embodiment, the unit of the single-day residence time threshold M is hours. In this embodiment, the single-day residence time is the sum of the residence time of the stable resident user in the target area in one day, that is, the whole day.
步骤S6,将有效停留天数大于等于预设的常住阈值N的所有稳定居住用户的总数作为常住人口数量。本实施例中常住阈值N的单位为天。Step S6: The total number of all stable resident users whose effective stay days are greater than or equal to the preset permanent threshold N is taken as the permanent population. In this embodiment, the unit of the permanent threshold N is day.
本实施例的基于手机信令数据的常住人口识别方法通过设定的单日驻留时间阈值M和常住阈值N从手机信令数据中筛选得到常住人口,因此,单日驻留时间阈值M和常住阈值N的设定方式对常住人口识别准确率有较大的影响。The method for identifying permanent residents based on mobile phone signaling data of this embodiment obtains permanent residents from mobile phone signaling data by screening out permanent residents through the set single-day residence time threshold M and permanent residence threshold N. Therefore, the setting method of the single-day residence time threshold M and the permanent residence threshold N has a great influence on the accuracy of permanent resident identification.
本实施例中通过目标地区对应的现有的人口普查数据和该人口普查数据对应的连续K天的历史手机信令数据,计算得到单日驻留时间阈值M和常住阈值N,从而提高单日驻留时间阈值M和常住阈值N的设置合理性。In this embodiment, the single-day residence time threshold M and the permanent residence threshold N are calculated through the existing census data corresponding to the target area and the historical mobile phone signaling data of K consecutive days corresponding to the census data, thereby improving the rationality of setting the single-day residence time threshold M and the permanent residence threshold N.
图3是本发明的实施例中计算单日驻留时间阈值M和常住阈值N的流程示意图。FIG3 is a schematic diagram of a flow chart of calculating a single-day residence time threshold M and a permanent residence threshold N in an embodiment of the present invention.
如图3所示,计算得到单日驻留时间阈值M和常住阈值N包括以下步骤:As shown in FIG3 , calculating the single-day residence time threshold M and the permanent residence threshold N includes the following steps:
步骤T1,设置初始单日驻留时间阈值、第一步长、最大时间阈值、初始常住阈值、第二步长和最大常住阈值。Step T1, setting the initial single-day residence time threshold, the first step length, the maximum time threshold, the initial permanent threshold, the second step length and the maximum permanent threshold.
本实施例中初始单日驻留时间阈值为0,第一步长为3,最大时间阈值为12,初始常住阈值为10,第二步长为1,最大常住阈值为K。In this embodiment, the initial single-day residence time threshold is 0, the first step length is 3, the maximum time threshold is 12, the initial permanent threshold is 10, the second step length is 1, and the maximum permanent threshold is K.
步骤T2,对历史手机信令数据进行预处理,得到各个历史用户的预处理历史手机信令数据。Step T2, pre-processing the historical mobile phone signaling data to obtain the pre-processed historical mobile phone signaling data of each historical user.
步骤T3,对各个历史用户,根据对应的预处理历史手机信令数据计算得到该历史用户对应的居住地。Step T3: for each historical user, the residence corresponding to the historical user is calculated based on the corresponding pre-processed historical mobile phone signaling data.
步骤T4,根据居住地和预处理历史手机信令数据,将对应的各个历史用户标记为历史稳定居住用户或历史非稳定居住用户。Step T4, according to the place of residence and the pre-processed historical mobile phone signaling data, the corresponding historical users are marked as historical stable resident users or historical unstable resident users.
步骤T5,将初始单日驻留时间阈值作为调试单日驻留时间阈值,将初始常住阈值作为调试常住阈值。Step T5, using the initial single-day residence time threshold as the debugging single-day residence time threshold, and using the initial permanent threshold as the debugging permanent threshold.
步骤T6,对各个历史稳定居住用户,将该历史稳定居住用户在目标地区的单日停留时长大于调试单日驻留时间阈值的当天作为历史有效停留日。Step T6: for each historical stable resident user, the day on which the single-day stay duration of the historical stable resident user in the target area is greater than the debugged single-day stay time threshold is taken as a historical effective stay day.
步骤T7,对各个历史稳定居住用户,将所有对应的历史有效停留日的总数作为历史有效停留天数。Step T7: for each historical stable resident user, the total number of all corresponding historical valid stay days is taken as the historical valid stay days.
步骤T8,将历史有效停留天数大于等于调试常住阈值的所有历史稳定居住用户的总数作为历史常住人口数量。Step T8, taking the total number of all historical stable resident users whose historical effective stay days are greater than or equal to the debugged permanent resident threshold as the historical permanent resident population.
步骤T9,将历史常住人口数量和人口普查数据进行Pearson相关性分析,得到相关性P。Step T9, performing Pearson correlation analysis on the historical permanent population and the census data to obtain the correlation P.
步骤T10,根据历史常住人口数量和人口普查数据,计算得到历史常住人口数量与人口普查数据中目标地区总人口数量的比值o。Step T10, based on the historical permanent population and the census data, calculate the ratio o of the historical permanent population to the total population of the target area in the census data.
步骤T11,判断调试单日驻留时间阈值是否大于最大时间阈值,若是,则执行步骤T12,若否,则将调试单日驻留时间阈值与第一步长的和作为新的调试单日驻留时间阈值,并执行步骤T6。Step T11, determine whether the debugging single-day residence time threshold is greater than the maximum time threshold. If so, execute step T12. If not, take the sum of the debugging single-day residence time threshold and the first step as the new debugging single-day residence time threshold, and execute step T6.
步骤T12,判断调试常住阈值是否大于最大常住阈值,若是,则执行步骤T13,若否,则将初始单日驻留时间阈值作为调试单日驻留时间阈值,将调试常住阈值与第二步长的和作为新的调试常住阈值,并执行步骤T6。Step T12, determine whether the debugging permanent threshold is greater than the maximum permanent threshold. If so, execute step T13. If not, use the initial single-day residence time threshold as the debugging single-day residence time threshold, and use the sum of the debugging permanent threshold and the second step length as the new debugging permanent threshold, and execute step T6.
步骤T13,判断最大的相关性P是否唯一,若是,则将最大的相关性P作为最优相关性P,若否,则将所有最大的相关性P中具有最大的比值o的相关性P作为最优相关性P。Step T13, determine whether the maximum correlation P is unique, if so, take the maximum correlation P as the optimal correlation P, if not, take the correlation P with the largest ratio o among all the maximum correlations P as the optimal correlation P.
步骤T14,将最优相关性P对应的调试单日驻留时间阈值作为单日驻留时间阈值M,并将最优相关性P对应的调试常住阈值作为常住阈值N。Step T14: the debugging single-day residence time threshold corresponding to the optimal correlation P is used as the single-day residence time threshold M, and the debugging permanent threshold corresponding to the optimal correlation P is used as the permanent threshold N.
本实施例中,将上海市作为目标地区,根据上海市2020年11月开始调查得到的人口普查数据即上海市第七次全国人口普查数据,以及上海市2020年11月联通手机信令数据作为该上海市第七次全国人口普查数据对应的历史手机信令数据,通过上述单日驻留时间阈值M和常住阈值N的计算方法得到单日驻留时间阈值M的取值为0,常住阈值N的取值为22。In this embodiment, Shanghai is taken as the target area. According to the census data obtained from the survey in Shanghai in November 2020, namely the seventh national census data of Shanghai, and the China Unicom mobile phone signaling data in Shanghai in November 2020 are used as the historical mobile phone signaling data corresponding to the seventh national census data of Shanghai, the value of the single-day residence time threshold M and the permanent residence threshold N are calculated through the above-mentioned method. The value of the single-day residence time threshold M is 0, and the value of the permanent residence threshold N is 22.
本实施例中单日驻留时间阈值M的取值为0,且常住阈值N的取值为22时,采用本实施例的基于手机信令数据的常住人口识别方法计算得到的常住人口分布与对应的人口普查获取的常住人口分布相关性达到0.926,显著高于现有的简单设定阈值的方法获得的相关性0.918,该现有的简单设定阈值的方法为将一个月在目标地区出现天数超过10天的手机用户识别为常住人口。In this embodiment, when the value of the single-day residence time threshold M is 0 and the value of the permanent threshold N is 22, the correlation between the permanent population distribution calculated by the permanent population identification method based on mobile phone signaling data in this embodiment and the permanent population distribution obtained by the corresponding census reaches 0.926, which is significantly higher than the correlation of 0.918 obtained by the existing simple threshold setting method. The existing simple threshold setting method identifies mobile phone users who appear in the target area for more than 10 days in a month as permanent residents.
实施例的作用与效果Functions and Effects of the Embodiments
根据本实施例所涉及的基于手机信令数据的常住人口识别方法,一方面,通过手机信令数据和符合人日常作息需求的指定时间段计算得到各个用户的居住地,并根据各个用户在居住地的停留时间数据判定该用户是否为目标地区的稳定居住用户;另一方面,根据人口普查数据和对应的历史手机信令数据计算得到符合目标地区的单日驻留时间阈值M和常住阈值N,再根据该单日驻留时间阈值M和常住阈值N对稳定居住用户在该目标地区的活动数据进行分析,从而判断该稳定居住用户是否为该目标地区的常住人口。总之,本方法能够准确地得到目标地区的常住人口数量。According to the method for identifying permanent residents based on mobile phone signaling data involved in this embodiment, on the one hand, the residence of each user is calculated through mobile phone signaling data and a specified time period that meets the daily needs of people, and whether the user is a stable resident user in the target area is determined based on the residence time data of each user in the residence; on the other hand, the single-day residence time threshold M and the permanent threshold N that meet the target area are calculated based on the census data and the corresponding historical mobile phone signaling data, and then the activity data of the stable resident users in the target area are analyzed based on the single-day residence time threshold M and the permanent threshold N, so as to determine whether the stable resident users are the permanent residents of the target area. In short, this method can accurately obtain the number of permanent residents in the target area.
本行业的技术人员应该了解,本发明不受上述实施例的限制,上述实施例和说明书中描述的只是说明本发明的原理,在不脱离本发明精神和范围的前提下,本发明还会有各种变化和改进,这些变化和改进都落入要求保护的本发明范围内。本发明要求保护范围由所附的权利要求书及其等效物界定。Those skilled in the art should understand that the present invention is not limited to the above embodiments, and the above embodiments and descriptions are only for explaining the principles of the present invention. Without departing from the spirit and scope of the present invention, the present invention may have various changes and improvements, and these changes and improvements fall within the scope of the present invention to be protected. The scope of protection of the present invention is defined by the attached claims and their equivalents.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410611548.3A CN118433649A (en) | 2024-05-16 | 2024-05-16 | Permanent population identification method based on mobile phone signaling data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410611548.3A CN118433649A (en) | 2024-05-16 | 2024-05-16 | Permanent population identification method based on mobile phone signaling data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN118433649A true CN118433649A (en) | 2024-08-02 |
Family
ID=92315672
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202410611548.3A Pending CN118433649A (en) | 2024-05-16 | 2024-05-16 | Permanent population identification method based on mobile phone signaling data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN118433649A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118627027A (en) * | 2024-08-14 | 2024-09-10 | 中国测绘科学研究院 | A method, device and medium for identifying permanent population based on mobile phone signaling |
-
2024
- 2024-05-16 CN CN202410611548.3A patent/CN118433649A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118627027A (en) * | 2024-08-14 | 2024-09-10 | 中国测绘科学研究院 | A method, device and medium for identifying permanent population based on mobile phone signaling |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107134142B (en) | A kind of urban road method for predicting based on multisource data fusion | |
CN118433649A (en) | Permanent population identification method based on mobile phone signaling data | |
CN111680637A (en) | Mask detection method and detection system based on deep learning and image recognition technology | |
CN111639092A (en) | Personnel flow analysis method and device, electronic equipment and storage medium | |
CN110990443A (en) | Mobile phone signaling-based professional and living population characteristic estimation method | |
WO2014187304A1 (en) | User state determination method and device | |
CN111291216B (en) | Method and system for analyzing foothold based on face structured data | |
CN112579922B (en) | Interest point recommendation method based on user track | |
CN112001829A (en) | Population distribution judgment method based on mobile phone signaling data | |
CN110378002A (en) | Social relationships modeling method based on motion track | |
CN117039850B (en) | Abnormal power consumption analysis method and system based on spatiotemporal features | |
CN115034524A (en) | Method, system and storage medium for predicting working population based on mobile phone signaling | |
CN111784534A (en) | A method and system for predicting the operation state of an integrated energy metering system with multiple subsystems | |
KR101538231B1 (en) | Skin texture predicting method and apparatus | |
CN106846795A (en) | The acquisition methods and device in the crowd is dense region | |
CN110287232A (en) | A method for finding suspected surrogate card swiping based on time series correlation and behavioral probability weighting | |
CN115879016B (en) | Prediction method of tidal period for shared bicycle travel | |
CN113112069A (en) | Population distribution prediction method, population distribution prediction system and electronic equipment | |
CN118349797A (en) | A cloud data processing system for measuring stations and Beidou base stations | |
CN114302333B (en) | User identification method and device, electronic device and medium | |
CN116629637A (en) | Multi-feature fusion trip purpose estimation method and system based on LBS/AOI/POI data | |
CN115879594A (en) | Urban settlement population distribution trend prediction method based on geographic detector | |
CN115222018A (en) | Non-invasive load decomposition method and device | |
CN112488478A (en) | Method and device for identifying topology of low-voltage transformer area and storage medium | |
CN117858018A (en) | Method for estimating liveness based on local space-time density |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |