I'm sorry but if you use Mastodon posts to train systems for filtering inappropriate content you will lose all eye contact, alcohol, US politics, and Star Wars discussions from your platform...
(This is from the paper "Mastodon Content Warnings: Inappropriate Contents in a Microblogging Platform" published by the University of Milan)
@Gargron Serious question: Does the corpus contain the original CW text?
But I guess if question is "What kinds of things do Masto users put in CWs?", then it doesn't matter
@Gargron It's unbelievable as a normal process of cognition is made impossible by that wall my brain erects each time the words "policy", "Italy" and "social network" appear together.
Server run by the main developers of the project It is not focused on any particular niche interest - everyone is welcome as long as you follow our code of conduct!