Header
vera.ai
vera.ai is a research and development project focusing on disinformation analysis and AI supported verification tools and services.
Online disinformation and fake media content have emerged as a serious threat to democracy, economy and society.
Recent advances in AI have enabled the creation of highly realistic synthetic content and its artificial amplification
through AI-powered bot networks. Consequently, it is extremely challenging for researchers and media professionals to
assess the veracity/credibility of online content and to uncover the highly complex disinformation campaigns.
vera.ai seeks to build professional trustworthy AI solutions against advanced disinformation techniques, co-created with
and for media professionals & researchers and to also set the foundation for future research in the area of AI against
disinformation.
Key novel characteristics of the AI models will be fairness, transparency (incl. explainability), robustness against concept
drifts, continuous adaptation to disinformation evolution through a fact-checker-in-the-loop approach, and ability to
handle multimodal and multilingual content. Recognising the perils of AI generated content, we will develop tools for
deepfake detection in all formats (audio, video, image, text).
vera.ai adopts a multidisciplinary co-creation approach to AI technology design, coupled with open source algorithms.
A unique key proposition is grounding of the AI models on continuously collected fact-checking data gathered from the
tens of thousands of instances of “real life” content being verified in the InVID-WeVerify plugin and the Truly Media/
EDMO platform. Social media and web content will be analysed and contextualised to expose disinformation campaigns
and measure their impact.
Results will be validated by professional journalists and fact checkers from project partners (DW, AFP, EUDL, EBU),
external participants (through our affiliation with EDMO and seven EDMO Hubs), the community of more than
53,000 users of the InVID-WeVerify verification plugin, and by media literacy, human rights and emergency response
organisations.
Assets related to vera.ai
Fraunhofer-Gesellschaft
Athens Technology Center
RINE: Leveraging Representations from Intermediate Encoder-Blocks for Synthetic Image Detection
A high-performing synthetic image detection method that utilises intermediate layers of the CLIP image encoder.
IDMT Audio Phylogeny Dataset
Datasets containing audio phylogeny trees for evaluation of audio phylogeny algorithms
ODSS: An Open Dataset of Synthetic Speech
Multilingual, multispeaker dataset of synthetic and natural speech, designed to foster research and benchmarking of novel studies on synthetic speech detection
Kempelen Institute of Intelligent Technologies
Keyframe Selection and Enhancement online service
This experimental AI-powered service automatically extracts representative keyframes from a video. The service also identifies and improves the resolution of faces and text detected within the keyframes for better clarity.
General Claim dataset
The General Claim dataset is a diverse, harmonized dataset created for the task of check-worthy claim detection, addressing the limitations of narrow, specialized datasets currently used in the field. Constructed from five pre-existing datasets, it emphas...
General Claim detection model - mDeBERTa V3
mDeBERTa V3 model which was trained during experiments in order to create models capable of detecting check-worthy claims in the widest achievable range.
General Claim detection model - XLM-RoBERTa
XLM-RoBERTa model which was trained during experiments in order to create models capable of detecting check-worthy claims in the widest achievable range.
General Claim detection model - LESA
LESA: Linguistic Encapsulation and Semantic Amalgamation Based Generalised Claim Detection from Online Content
VERITE: A Dataset for Image-Text Verification
A robust benchmark for multimodal misinformation detection, including out-of-context and miscaptioned images, that mitigates unimodal biases.
CREDULE: A Dataset for Evidence Verification
A dataset for improving automated fact-checking by addressing leaked and unreliable web-collected evidence.