|International Journal of Geographical Information Science
|Jiang Jincheng,Li Qinqquan*,Tu Wei,Shaw Shi-lung,Yue Yang
Sampling fraction is crucial to sampling-related studies and applications, especially in the big data era when most data are neither originally designed nor controllable in the data collection process. A common concern among researchers is ‘what’s the modelling accuracy when using a sample?’. Taking intra-city human mobility as the study objective, this study utilizes a simple and direct method to analyse the influences of various sampling fractions on modelling accuracy. Five common intra-city human mobility indicators (travel distance, travel time, travel frequency, radius of gyration and movement entropy) are evaluated considering mean value, median and probability distribution. Experimental results demonstrate that the representativeness of each considered indicator converges to 1 in its own unique rate and variances. The minimum required sampling fractions to satisfy specific accuracies differ for various indicators and evaluation measures. To further investigate how related factors influence the modelling accuracy of sampling fractions, additional experiments are conducted considering multiple sampling methods, study scopes, and data sources. Several interesting general findings are observed. This study provides a reference for other sampling-based applications.