Although a lot of the relevant data analysed by VerbaAlpina already exists (especially in atlases and dictionaries), there are also plans to collect more. The goal is to (1) find and correct inconsistencies within the pre-existing sources, (2) fill in the gaps and resolve inaccuracies, and (3) identify traditional terms and tools that have been passed down from generation to generation. The new data, however, won’t be collected with the classical field study methods, but rather using the means provided by social media. This method is usually referred to as crowdsourcing.
„Crowdsourcing ist eine interaktive Form der Leistungserbringung, die kollaborativ oder wettbewerbsorientiert organisiert ist und eine große Anzahl extrinsisch oder intrinsisch motivierter Akteure unterschiedlichen Wissensstands unter Verwendung moderner IuK-Systeme auf Basis des Web 2.0 einbezieht." (Martin/Lessmann/Voß 2008, translation: Crowdsourcing is a form of interactive service, organised competitively or collaboratively. It involves a large number of participants who are motivated extrinsically or intrinsically and whose knowledge on the subject can vary, and it is made possible thanks to the use of modern information and communication systems, based on the web 2.0).
The term crowd can be at times misleading, as it is often associated with arbitrariness, amateurishness, and lack of reliability. Those concerns are not entirely unjustified, as it is true that this method must rely on an unknown and anonymous amount of potential interested parties. Issues can arise both on the side of the scientific researcher supplying the project as well as on the side of the addressees (who can be unaffiliated to the project, but do not have to be): The supply has to be sufficiently visible and attractive and the addressee has to be knowledgeable enough in the field studied. There are multiple strategies available to approach that. The attractiveness of the project can be increased through games, thus making it entertaining. An example for this is the play4science. The experiences collected in the project above, however, also indicate that communicating to the addressee that the findings of the study will directly influence their language skills and expertise, is much more effective in making the supply enticing (see citizen science-Projekte). The competence of the addressees can be assessed through targeted tests, but it is without a doubt more effective to confirm the validity of data by comparing it with other people’s answers. A successful geolinguistics pilot-project which made use of crowdsourcing is Stephan Elspaß and Robert Möller‘s Atlas zur deutschen Alltagssprache (AdA); this project marks a milestone for the development of digial geolingustics.
The goal of VerbaAlpina is to transcribe data from printed sources, especially dictionaries and linguistic atlases, and collect them in a structured database; examine and possibly correct already available transcriptions; typecast and assign lexical lemmata to already transcribed data. Comments about things like the origin and spread of words are also welcome. VerbaAlpina is also really interested in current linguistic material, which has yet to be documented in written sources. Whoever possessed knowledge about a dialect in the Alpine region is therefore welcome to submit expressions used in those dialects and contribute to the VerbaAlpina database. Collecting various sources allows for both an enrichment of the already printed data, as well as an understanding of the dynamic processes involved in causing linguistic change. This works best if a lot of people are involved. Pictures of typically alpine objects, but also of pastures, huts, flora, fauna, mointains and langscapes are also welcome. They will be saved in the media library.
Alongside the collaboration with VerbaAlpina, each user has the opportunity to create their own research environment, in which they can collect language (or any other) data. The only condition is that the data remains georeferenced. They have the chance to encrypt the data and only save it for their personal use or share it with other users, in order to discuss and comment. The potential of the database therefore relies on a large amount of data being accessible to the public.
VerbaAlpina documents the vitality of the crowdsourcing-tools in a separate page. The experiences collected over the last two years using the crowdsourcing tools have shown that publicity is essential for a successful implementation of this method. Whenever the crowdsourcing tool is addressed in the public, crowd activity increases.
Alongside WordPress and the crowdsourcing module developed within the platform, VerbaAlpina also uses Zooniverse’s platform. Zoonivers is a Citizen Science Portal that approaches volunteers on the Internet to carry out specific tasks. The tool developed for this purpose can be found here: https://www.zooniverse.org/projects/filip-hr/verbaalpina/classify. The original idea was to use the free and already available crowdsourcing tool developed by Zooniverse and therefore save time to focus on timesensitive and elaborate in-house development. Furthermore, Zooniverse’s community already offered a profitable space to reach a large amount of volunteering crowders and therefore reach a quantitatively large amount of people.
Originally, the first phase of the AIS was meant to translate crowder transcriptions into maps and language atlases. During the course of the development, which was led primarily by Filip Hristov, however, it became clear that the original expectations towards Zooniverse would not have been met. Therefore, already the preparation of the Zooniverse-software proved to be more complicated than expected. In addition to that, Zooniverse-moderators also contributed and demanded changes to the concept and concrete set-up of the tool. All this delayed the launch of the tool to the 31st of March 2021.
Concerns regarding whether or not voluntary and inexperienced crowders would be capable of fullfilling the rather complex process of transcriptions were raised already during the development phase. This led to a change in the task description assigned to crowders. Transcription was introduced as an optional additonal task. The priority was instead shifted towards the manually more intuitive assignment of written words to numbers on the map. The numbers stood for other informants, who had uttered the words. In other words, crowders are asked to pick out text from a map by encompassing it in a rectangle and then assigning that text box to the correct informant number. The coordinates of the text boxes then get saved in the VerbaAlpina database. Thanks to this data, it is possible to automatically select and translate picture-data from a map into the OCR-program.



