👋 I’m a second-year PhD student at UCL SpaceTimeLab, researching conversational systems (large language models) for complex routing problems, supervised by Dr James Haworth, Dr Aldo Lipani, and Dr Stefano Cavazzi. I am broadly interested in understanding how language models can be adapted for geospatial data, and how to adapt geospatial data for LLMs. My research is funded by UK Research and Innovation (UKRI/EPSRC) and the Ordnance Survey.
Recent publications
-
📄 Quantifying Geospatial in the Common Crawl Corpus (accepted to SIGSPATIAL’24)
-
📄 CC-GPX: Extracting High-Quality Annotated Geospatial Data from Common Crawl (accepted to SIGSPATIAL’24)
-
📄 Do Sentence Transformers Learn Quasi-Geospatial Concepts from General Text? (presented at GeoExT/ECIR 2024)
Teaching
At UCL, I’m a postgraduate teaching assistant for Geospatial Programming (CEGE0096), Spatial Analysis and Computation (CEGE0097), and Spatial-Temporal Data Analysis and Data Mining (CEGE0042) in 2023/24 academic year.
Experience
Prior to UCL, I spent five years in industry, first as a full-stack developer for an open data consultancy CTData Collaborative, and then as a data engineer for a location planning firm Geolytix.
I completed MSc in Geographic Information Science at the University of Leeds (UK), and BSc in Computer Science and Studio Arts at Trinity College in Connecticut (US). I spent one year of my undergraduate degree at the University of Oxford (Worcester College), where I focused on machine learning.
Achievements ☄️
Together with Jack Dougherty, I co-authored O’Reilly’s Hands-On Data Visualization: Interactive Storytelling from Spreadsheets to Code, which was translated from English into Korean and Traditional Chinese.