TY - JOUR
T1 - SafePath: Differentially-Private Publishing of Passenger Trajectories in Transportation Systems
T2 - Differentially-private publishing of passenger trajectories in transportation systems
AU - Al-Hussaeni, Khalil
AU - Fung, Benjamin C. M.
AU - Iqbal, Farkhund
AU - Dagher, Gaby G.
AU - Park, Eun G.
N1 - Al-Hussaeni, Khalil; Fung, Benjamin C.M.; Iqbal, Farkhund; Dagher, Gaby G.; and Park, Eun G.. (2018). "SafePath: Differentially-Private Publishing of Passenger Trajectories in Transportation Systems". Computer Networks, 143, 126-139.https://doi.org/10.1016/j.comnet.2018.07.007
PY - 2018/10/9
Y1 - 2018/10/9
N2 - In recent years, the collection of spatio-temporal data that captures human movements has increased tremendously due to the advancements in hardware and software systems capable of collecting person-specific data. The bulk of the data collected by these systems has numerous applications, or it can simply be used for general data analysis. Therefore, publishing such big data is greatly beneficial for data recipients. However, in its raw form, the collected data contains sensitive information pertaining to the individuals from which it was collected and must be anonymized before publication. In this paper, we study the problem of privacy-preserving passenger trajectories publishing and propose a solution under the rigorous differential privacy model. Unlike sequential data, which describes sequentiality between data items, handling spatio-temporal data is a challenging task due to the fact that introducing a temporal dimension results in extreme sparseness. Our proposed solution introduces an efficient algorithm, called SafePath , that models trajectories as a noisy prefix tree and publishes ϵ -differentially-private trajectories while minimizing the impact on data utility. Experimental evaluation on real-life transit data in Montreal suggests that SafePath significantly improves efficiency and scalability with respect to large and sparse datasets, while achieving comparable results to existing solutions in terms of the utility of the sanitized data.
AB - In recent years, the collection of spatio-temporal data that captures human movements has increased tremendously due to the advancements in hardware and software systems capable of collecting person-specific data. The bulk of the data collected by these systems has numerous applications, or it can simply be used for general data analysis. Therefore, publishing such big data is greatly beneficial for data recipients. However, in its raw form, the collected data contains sensitive information pertaining to the individuals from which it was collected and must be anonymized before publication. In this paper, we study the problem of privacy-preserving passenger trajectories publishing and propose a solution under the rigorous differential privacy model. Unlike sequential data, which describes sequentiality between data items, handling spatio-temporal data is a challenging task due to the fact that introducing a temporal dimension results in extreme sparseness. Our proposed solution introduces an efficient algorithm, called SafePath , that models trajectories as a noisy prefix tree and publishes ϵ -differentially-private trajectories while minimizing the impact on data utility. Experimental evaluation on real-life transit data in Montreal suggests that SafePath significantly improves efficiency and scalability with respect to large and sparse datasets, while achieving comparable results to existing solutions in terms of the utility of the sanitized data.
KW - differential privacy
KW - smart city
KW - sparse data
KW - trajectory data
KW - transportation
UR - https://scholarworks.boisestate.edu/cs_facpubs/159
UR - https://doi.org/10.1016/j.comnet.2018.07.007
UR - http://www.scopus.com/inward/record.url?scp=85049953091&partnerID=8YFLogxK
U2 - 10.1016/j.comnet.2018.07.007
DO - 10.1016/j.comnet.2018.07.007
M3 - Article
SN - 1389-1286
VL - 143
SP - 126
EP - 139
JO - Computer Networks
JF - Computer Networks
ER -