|
Anna Kruspe, Jens Kersten, & Friederike Klan. (2019). Detecting event-related tweets by example using few-shot models. In Z. Franco, J. J. González, & J. H. Canós (Eds.), Proceedings of the 16th International Conference on Information Systems for Crisis Response And Management. Valencia, Spain: Iscram.
Abstract: Social media sources can be helpful in crisis situations, but discovering relevant messages is not trivial. Methods
have so far focused on universal detection models for all kinds of crises or for certain crisis types (e.g. floods).
Event-specific models could implement a more focused search area, but collecting data and training new models for
a crisis that is already in progress is costly and may take too much time for a prompt response. As a compromise,
manually collecting a small amount of example messages is feasible. Few-shot models can generalize to unseen
classes with such a small handful of examples, and do not need be trained anew for each event. We show how
these models can be used to detect crisis-relevant tweets during new events with just 10 to 100 examples and
counterexamples. We also propose a new type of few-shot model that does not require counterexamples.
|
|
|
Jens Kersten, Jan Bongard, & Friederike Klan. (2022). Gaussian Processes for One-class and Binary Classification of Crisis-related Tweets. In Rob Grace, & Hossein Baharmand (Eds.), ISCRAM 2022 Conference Proceedings – 19th International Conference on Information Systems for Crisis Response and Management (pp. 664–673). Tarbes, France.
Abstract: Overload reduction is essential to exploit Twitter text data for crisis management. Often used pre-trained machine learning models require training data for both, crisis-related and off-topic content. However, this task can also be formulated as a one-class classification problem in which labeled off-topic samples are not required. Gaussian processes (GPs) have great potential in both, binary and one-class settings and are therefore investigated in this work. Deep kernel learning combines the representative power of text embeddings with the Bayesian formalism of GPs. Motivated by this, we investigate the potential of deep kernel models for the task of classifying crisis-related tweet texts with special emphasis on cross-event applications. Compared to standard binary neural networks, first experiments with one-class GP models reveal a great potential for realistic scenarios, offering a fast and flexible approach for interactive model training without requiring off-topic training samples and comprehensive expert knowledge (only two model parameters involved).
|
|