Measures

The GeoSmoking Study collected data across multiple domains to understand the relationship between tobacco retail environments and smoking behavior.

📊 Request Data Sharing 📁 Explore Interactive codebook 🗺️ View Tobacco Retailer Database

Data Access & Documentation

🔒 Data Access

  • De-identified datasets available for research
  • Data use agreements required
  • Secure data sharing protocols
  • Collaborative analysis opportunities

📄 Documentation

  • Complete variable codebook with descriptions
  • Data collection protocols and procedures
  • Quality control and validation methods
  • Missing data patterns and handling

Data Overview

📊 Recruitment

We screened over 4,000 potential participants recruited with the help of BuildClinical

310 Total Participants Enrolled

📍 Geospatial Data

GPS coordinates, retail environment mapping, and tobacco outlet exposure measures collected continuously throughout the 6-week study.

3.1M+ GPS coordinates collected

📱 EMA Data

Real-time ecological momentary assessments of craving, mood, and smoking behavior via smartphone app. Surveys were sent 4 times a day throughout the entire study period

67,000+ EMA responses collected

Data Collection Timeline

Baseline

2 Weeks

Initial Assessment -> 2 Week Data Collection Period

Demographics, smoking history, psychometric measures, location tracking setup

Intervention

4 Weeks

3 Conditions: Control, Tobacco Store Visits, Non-Tobacco Store Visits

EMA surveys, GPS tracking, and image rating tasks throughout study period

Follow-up

Post-Intervention

Optional Neuro-imaging study with additional survey measures


Data Categories

📋 Demographics & Screening

Participant characteristics, eligibility criteria, and baseline smoking behavior collected during initial screening and enrollment.

Key Variables:


📊 Psychometric Assessments

Validated survey instruments administered at multiple timepoints to assess smoking-related behaviors, cognitions, and individual differences.

Survey Measures

Instrument Categories:

Survey measures were collected over Qualtrics during each Online Study Session and during the optional fMRI study session.

Questions asked during online sessions


Ecological Momentary Assessment (EMA) through LifeData

📱 What is EMA?

We used LifeData's Real Life Exp app to deliver surveys to participant phones throughout the day for the entire study. This let us gather real-time data on a person's immediate mood, cravings, and emotions as they went about their daily life.

Participants chose a start time that fit with their schedule and received surveys 4 times a day. In order to continue to the intervention phase of the study, participants needed to respond to at least 75% of surveys during the baseline phase.

EMA Schedule: Participants receive surveys at two set times a day, and twice at any point during a survey window. For an 8am start time, these windows are between 8am and 12pm, and 2pm and 6pm, and the set times are at 1pm and 7pm.
8am start time schedule
A) 8am start time
10am start time schedule
B) 10am start time
Adapted from Muzekari, et al (2025). Naturalistic Tobacco Retail Exposure and Smoking Outcomes in Adults Who Smoke Cigarettes Daily JAMA Network Open, https://doi.org/10.1001/jamanetworkopen.2025.30132

Questions asked during ema

Geolocation Data

📍 Participant Geolocation

Participants' location data was collected through Google Maps Timeline and Location History feature. Participants were given detailed instructions based on their phone type to set up their location tracking. Participants would then download their data through Google Takeout and upload their location history through a Qualtrics survey. If participants had any technical issues they could schedule a help call to have a researcher walk them through all parts of the setup, download, and export of their data.

📍 Tobacco Retailer Locations

Tobacco retailer information is publicly available through open data sites for PA and DE, and updated tobacco retailer lists were downloaded by the study team monthly. Tobacco retailer information for NJ was available via request and was requested yearly by the study team.

The tobacco retailer lists from each state consisted of trade name, license number, license type, and street address. The study team constructed a custom codebase to pre-process the tobacco retailer data. For instance, one feature adds the latitude and longitude coordinates of the NJ tobacco retailers based on the provided street addresses, enabling further cross-referencing between tobacco retailer location and participant locations collected via Google Maps. License start and expiration dates were also generated based on tobacco retailers appearing or being removed in newly published databases.

In total, the custom database contained 36,580 tobacco retailers, including 23,293 in PA, 11,843 in NJ, and 1,444 in DE. In cases where the tobacco retailer location was incorrectly provided, but recovering the correct location was feasible, we manually obtained the corrected geolocation via Google Maps and updated the database. Examples of cases that prompted further investigation include a missing address, an address outside of the three states, a non-existent address, or many tobacco retailers with the same address.

🗺️ View Tobacco Retailer Database

Map of tobacco retailers

A map of all of the tobacco retailers in our database, across Pennsylvania, Delaware, and New Jersey

fMRI Data

🧠 Neuroimaging Data

fMRI data collected during tobacco cue reactivity tasks to measure neural responses to smoking and retail environment cues.

🧠 Neuroimaging Data Acquisition

Functional magnetic resonance imaging (fMRI) data were collected during tobacco cue reactivity tasks to measure neural responses to smoking and retail environment stimuli.

Scanner Specifications

Scanner: Siemens Prisma 3T
Institution: UPenn (SC3T)
Receive Coil: 32-channel head coil
Software: syngo MR E11

BOLD Sequence Parameters

Sequence: Multiband EPI-BOLD
Repetition Time (TR): 3.0 seconds
Echo Time (TE): 32 ms
Flip Angle: 90°
Field of View: 84 × 84 matrix

Spatial Resolution

Slice Thickness: 3.0 mm
Slice Spacing: 3.0 mm (no gap)
Number of Slices: 46 axial slices
Phase Encoding: Posterior → Anterior

Acquisition Details

Task Protocol: Image rating task
Acquisition Type: 2D multi-slice
Partial Fourier: 7/8 (87.5%)
Bandwidth: 2290 Hz/pixel

📋 Technical Notes

🎯 fMRI Task Design

Imaging Components: