DIGITAL-2024-AI-06-LANGUAGE-01

Alliance for Language Technologies (SG) -

⚫ indicates current topic
Node background color indicates the call topic status
Double click on a topic to center information around it.
Node size is proportional to distance from the current topic

About the connections

The graph above was generated based on the following links

  • HORIZON-CL4-2024-HUMAN-03-01
    Advancing Large AI Models: Integration of New Data Modalities and Expansion of Capabilities (AI, Data and Robotics Partnership) (RIA)

    MOTIVATION DIGITAL-2024-AI-06-LANGUAGE-01 centres on providing data for the development of European large language models, which may be used also as part of the multimodal foundation models developed under HORIZON-CL4-2024-HUMAN-03-01.

Call text (as on F&T portal)

View on F&T portal
Expected Outcome:

Deliverables

  • Increased accessibility to language data for the development and adaptation of large language foundation models, in consideration of issues linked to data privacy and security, as well as potential risks of disinformation.
  • A repository of families of existing large language foundation models for public and industrial reuse in the EU.
  • A repository of families of large language models fine-tuned to specific languages, domains or industries.
  • Infrastructure and services for models fine-tuning
Objective:

Through federating Member States efforts, this action will directly contribute to preserving the linguistic and cultural diversity in Europe while effectively implementing the European Common Data Infrastructure and Service MCP’s objectives in the area of language technologies. By providing the necessary data and model adaptation capacities, the action will have a strong impact on the deployment of large language foundation models and their applications such as generative AI. This federated effort will be established around two work strands.

First work strand – Data collection & Fine-Tuning

The first work strand will support the language data collection and the adaptation of existing large language foundation models to specific languages, domains or industries so as to support the onboarding of the latest language technologies by the European actors.

Scope:

Scope

Data:

Leveraging on the Common European Language Data Space and other relevant Data Spaces, this activity will, in compliance with the applicable legislation (e.g. Copyright and GDPR), gather the necessary language data (text, audio, image and other modalities) from a broad array of European industrial, academic and institutional actors, and provide data in sufficient quality and quantity that can be made available to build large language foundation models, ensuring a coherent coverage of all the official languages of the Member States as well as the most socially and economically relevant ones. This will also include providing data required to adapt such large language foundation models to specific languages, domains or industries. The action will also provide a repository of existing European Large Language foundation models as well as models adapted to specific languages, domains or industries. Once sufficiently advanced, the consortium may consider working on a future copyright infrastructure and related issues to allow efficient use of language and other data, while taking into account the interests of the rightsholders.

Fine-tuning:

  • This activity will also provide large language models fine-tuned to specific languages, domains or industries as a result of further training of large language foundation models on specific language data. This process involves adapting, evaluating and optimizing foundation models for specific languages, domains or industries. It will facilitate the efficient deployment of these models across various industries, requiring less task-specific data compared to building models from scratch, which is particularly advantageous for SMEs. The action will also include the support for the ongoing maintenance and enhancement of these models, ensuring their adaptability to evolving tasks and domains over time.
  • In addition, this activity will also provide, including through Financial Support to Third Parties, dedicated supports and services, in particular for SMEs, to facilitate the fine-tuning of available models. This supports and services will provide third parties with an infrastructure to fine-tune and evaluate existing models for their purpose.

The EuroHPC Joint Undertaking would provide access to their facilities for the adaptation and fine tuning of the models when necessary. The consortium that will carry out this action should be composed by representatives of Member States; public and private organisations, SMEs, RTOs; entities with access to large compute capacities; public and/or private data providers, such as the media or publishing industry.

News flashes

2024-07-31

For information on the evaluations results of this call we invite you to consult the Flash call info (evaluation results) in the following link.

 

2024-05-21

Please note that the call document has been amended to include a minor update under topic DIGITAL-2024-AI-06-FINETUNE  (page 15 and page 25). It is to be noted that this update DOES NOT modify in any way the topic conditions, the eligibility or evaluation criteria and it is only relevant to this topic.

2024-02-29
The submission session is now available for: DIGITAL-2024-AI-06-IMAGING(DIGITAL-SME), DIGITAL-2024-AI-06-LANGUAGE-01(DIGITAL-SIMPLE), DIGITAL-2024-AI-06-LANGUAGE-02(DIGITAL-CSA), DIGITAL-2024-AI-06-FINETUNE(DIGITAL-SME)
call topic details
Call status: Closed
Publication date: 2024-02-14 (1 year ago)
Opening date: 2024-02-29 (1 year ago)
Closing date: 2024-05-29 (11 months ago)
Procedure: single-stage

Budget: 20000000
Expected grants: 0
News flashes

This call topic has been appended 3 times by the EC with news.

  • 2024-07-31
    for information on the evaluations resul...
  • 2024-05-21
    please note that the call document has b...
  • 2024-02-29
    the submission session is now available...
Call

DIGITAL-2024-AI-06

Call topics are often grouped together in a call. Sometimes this is for a thematic reason, but often it is also for practical reasons.

There are 3 other topics in this call:

Source information

Showing the latest information. Found 5 versions of this call topic in the F&T portal.

Information from

  • 2025-02-11_03-20-10
  • 2024-11-23_03-20-11
  • 2024-09-30_21-21-11
  • 2024-07-04_15-02-40
  • 2024-03-30_14-27-07

Check the differences between the versions.

Annotations (will be publicly visible when approved)

You must be logged in to add annotations
No annotations yet

Events

This is just a very first implementation, better visualisation coming

Events are added by the ideal-ist NCP community and are hand-picked. If you would like to suggest an event, please contact idealist@ffg.at.

Call topic timeline

What phase of the topic timeline are we in? This timeline contains some suggestions on what are realistic actions you should or could take at this moment. The timeline is based on the information provided by the call topic.
  1. Work programme available

    - 1 year ago

    The call topics are published first in the Work Programme, which is available a while before the call opens. By following up the Work Programme publications, you can get a headstart.

  2. Publication date

    - 1 year ago

    The call was published on the Funding & Tenders Portal.

  3. Opening date

    - 1 year ago

    The call opened for submissions.

  4. Closing date

    - 11 months ago

    Deadline for submitting a project.

  5. Time to inform applicants Estimate

    - 6 months ago

    The maximum time to inform applicants (TTI) of the outcome of the evaluation is five months from the call closure date.

  6. Sign grant agreement Estimate

    - 3 months ago

    The maximum time to sign grant agreements (TTG) is three months from the date of informing applicants.

  7. Today

Funded Projects

Loading...

Project information comes from CORDIS (for Horizon 2020 and Horizon Europe) and will be sourced from F&T Portal (for Digital Europe projects)