|During a natural disaster or emergency event, individual social media posts or hot spots may not necessarily correlate
to the most devastated areas. To better understand the correlation between social media and physical damage, we
compare Tweets, data about the physical environment, and socio-economic factors with insurance claim information
(as a proxy for physical damage) from 2017 Hurricane Irma in the state of Florida. We use machine learning
to identify relevant Tweets, sensitivity analyses to identify socio-economic factors, and statistical regression to
determine the predictive capability of insurance claims as a proxy for damage. We find that Tweets alone result in a
poorly fitted regression model of insurance claims, but the inclusion of physical features (e.g., power outages, wind
level) and socio-economic factors (e.g., population density, education, Internet access) improves the model?s fit.
Such models contribute to the knowledge base that may allow social media to predict damage in real-time.