Skip to content
The picture shows a young lion, symbolising our automated German text simplifier Simba.

Simba: Intelligent Text Simplification for a More Inclusive Internet

Berlin Research Institute Presents New Free AI Application to Reduce Language Barriers in Online Texts

 

Berlin, 09.09.2024. Simba has a clear mission: to make the internet more understandable for everyone. This web app, designed to simplify German-language texts, was developed by Freya Hewett, a researcher at the Alexander von Humboldt Institute for Internet and Society (HIIG). Simba offers two AI-powered solutions: a web app for simplifying personal texts and a browser extension that automatically summarises texts on websites. This reduces digital language barriers, enhances the reading experience and facilitates access to the German language.

Simplified Language for All

People use the internet every day to keep up to date with current affairs. However, many articles on websites and online publications are  difficult to understand because they have complex sentence structures and use technical jargon. According to Freya Hewett, a computational linguist at HIIG and the initiator of Simba, this particularly disadvantages individuals with learning difficulties or those learning German as a foreign language. “Our research shows that the websites of government bodies and institutions in the education and science sectors often exclude a significant proportion of the population from important information due to their complicated language,” Hewett explains. “Simplified language can help bridge these gaps. Participation in society depends on being able to access information and services online.”

Simba is the first free solution specifically designed to target end users, enabling them to simplify texts in their everyday life. Its AI-powered apps replace long words with shorter ones that have similar or identical meanings. They reduce sentence length and add additional information to make connections clearer. Until now, similar automated language-simplification solutions in Germany were fee-based and primarily used by institutions and businesses. For many people who need to have easily understandable texts, these services were inaccessible due to paywalls. Simba fills this gap by allowing everyone to simplify texts themselves. “Our goal is for Simba to become an everyday tool that helps all people who want to use text simplification in their daily lives”, emphasises Dr. Theresa Züger, head of the research group where the AI application was developed.

Public Interest-Oriented Artificial Intelligence and Transparency

Dr. Theresa Züger’s team explores what qualities artificial intelligence should have for the common good of society. These insights are implemented in practical prototypes, with Simba being the first to be released. You can find more information on this on the publicinterest.ai website.

One distinctive feature of Simba in this regard is its focus on common good. The AI app is operated on a non-commercial, not-for-profit basis, and the developers are available for inquiries. The source code and underlying models are openly accessible, allowing for transparent collaboration. Freya Hewett emphasises: “We want to collaborate with a community of researchers, inclusion experts and users to continuously develop and improve Simba. The target groups we address – such as people with learning difficulties or those who do not speak German as their first language – are very diverse. Our goal is to improve the language model through ongoing feedback and create simplifications that genuinely benefit many people.”

The Technology Behind Simba

Simba’s two applications are based on a text generation model. These models, also known as large language models or foundation models, are trained with large amounts of text data. They predict which word is most likely to come next in a sequence. Examples of such models include GPT-4, Mistral 7B and Llama. For Simba, the team used the Llama-3-8B-Instruct foundation model, which was fine-tuned with German-language newspaper articles.

“As with all text generation models, there is a possibility that automatically generated summaries may contain incorrect information,” explains Freya Hewett. “Nevertheless, we are confident that Simba provides valuable support.” Hewett advises carefully comparing the input and output text of the AI application to ensure the accuracy of the information.

Seeking Partners

Simba is available as a prototype free of charge for the time being, but the ongoing costs of operating such an AI application are significant. To ensure the continuous availability and development of Simba, HIIG is looking for additional partners. Companies and organisations are invited to get involved and support either the hosting or further development of the AI application. These partnerships aim to advance Simba’s mission and promote digital inclusion.

The beta version of Simba was developed as part of the “Public Interest AI” research group funded by the Federal Ministry of Education and Research (BMBF) at HIIG, under the leadership of Dr. Theresa Züger. Freya Hewett trained the language model and received technical implementation support from Hadi Asghari and Christopher Richter. Larissa Wunderlich designed the user interface and created the website.

Instructions for Simba

You can find a detailed guide on how to use the internet app and browser extension here. It is written in simplified language.

 

Press contact
Frederik Efferenn | Tel. +49 30 200 760 82 | presse@hiig.de


The Alexander von Humboldt Institute for Internet and Society

The Alexander von Humboldt Institute for Internet and Society (HIIG) researches the development of the Internet from a societal perspective in order to better understand the accompanying digitalisation of all areas of life. As the first research institute in Germany with a focus on Internet and society, HIIG has developed an understanding that emphasises the embedding of digital innovations in social processes. Based on this transdisciplinary expertise and as part of the Global Network of Interdisciplinary Internet & Society Research Centers, HIIG aims to develop a European response to digital structural change.

Frederik Efferenn

Head of Science Communication

SIMBA: SIMPLE AND ACCESSIBLE

Our AI applications help to understand German texts and summarise them in simple language. Our aim is to give as many different people as possible access to online texts and information.

Explore current HIIG Activities

Research issues in focus

HIIG is currently working on exciting topics. Learn more about our interdisciplinary pioneering work in public discourse.

Press Mailing List & media coverage

To the pressroom