Skip to main content

The Learning Curve, Part 1: Why Teaching AI New Languages Begins with Data



As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as Live Translate, Interpreter and Note Assist. But what does AI language development involve? This series examines the challenges of working with mobile AI and how we overcame them. First up, we head to Indonesia to learn where one begins teaching AI to speak a new language.
 




The first step is establishing targets, according to the team at Samsung R&D Institute Indonesia (SRIN). “Great AI begins good quality and relevant data. Each language demands a different way to process this, so we dive deep to understand the linguistic needs and the unique conditions of our country,” says Junaidillah Fadlil, head of AI at SRIN, whose team recently added Bahasa Indonesia (Indonesian language) support to Galaxy AI. “Local language development has to be led by insight and science, so every process for adding languages to Galaxy AI starts with us planning what information we need and can legally and ethically obtain.” 

Galaxy AI features such as Live Translate perform three core processes: automatic speech recognition (ASR), neural machine translation (NMT) and text-to-speech (TTS). Each process needs a distinct set of information.

 

ASR, for instance, needs extensive recordings of speech in numerous environments, each paired with an accurate text transcription. Varying background noise levels help account for different environments. “It’s not enough just to add noises to recordings,” explains Muchlisin Adi Saputra, the team’s ASR lead. “In addition to the language data we obtained from authorized 3rd party partners, we must go out into coffee shops or working environments to record our own voices. This allows us to authentically capture unique sounds from real life, like people calling out or the clattering of keyboards.”

 

The ever-changing nature of languages must also be considered. Saputra adds: “We need to keep up to date with the latest slang and how it is used, and mostly we find it on social media!”

Next, NMT requires translation training data. “Translating Bahasa Indonesia is challenging,” says Muhamad Faisal, the team’s NMT lead. "Its extensive use of contextual and implicit meanings relies on social and situational cues, so we need numerous translated texts that the AI could reference for new words, foreign words, proper nouns, and idioms – any information that helps AI understand the context and rules of communication.”
 

TTS then requires recordings that cover a range of voices and tones, with additional context on how parts of words sound in different circumstances. “Good voice recordings could do half the job and cover all the required phonemes (units of sound in speech) for the AI model,” adds Harits Abdurrohman, TTS lead. “If a voice actor did a great job in the earlier phase, the focus shifts to refining the AI model to clearly pronounce specific words.” 

 

Stronger Together


It takes vast resources to plan for much data, and SRIN worked closely with linguistics experts. “This challenge requires creativity, resourcefulness and expertise in both Bahasa Indonesia and machine learning,” Fadlil reflects. “Samsung’s philosophy of open collaboration played a big part in getting the job done, as did our scale of operations and history of AI development.”

Working with other Samsung Research centers around the world, the SRIN team was able to quickly adopt best practices and overcome the complexities of establishing data targets. Furthermore, collaboration was good for advancing not only technology but also culture. When the SRIN team joined their counterparts in Bangalore, India, they observed the local fasting customs, creating deeper connections and expanding their understanding of different cultures.
 

For the team, Galaxy AI’s language expansion project took on a new significance. “We are particularly proud of our achievements here as this was our first AI project, and it won’t be our last as we continue to refine our models and improve the quality of output,” Fadlil concludes. “This expansion not only reflects our values of openness but also respects and incorporates our cultural identities through language.”
 

In the next episode of The Learning Curve, we will head to Samsung R&D Institute Jordan to speak to the team who led Galaxy AI’s Arabic language project. Tune in to learn about the complexities of building and training an AI model for a language with diverse dialects.


Comments

Popular posts from this blog

ASEAN Centre for Energy (ACE) to Guide Regional Energy Transition through the Launch of the 8th ASEAN Energy Outlook (AEO8)

Photo 1. (left-right), Beni Suryadi, Acting Executive Director of ACE, H.E. Phosay Sayasone, Minister of Energy and Mines, Lao PDR, Dato’ Mohamad Razif bin Haji Abdul Mubin, Deputy Secretary General (Energy) at the Ministry of Energy Transition and Water Transformation (PETRA), Malaysia , Dr Zulfikar Yurnaidi, Head of Energy Modelling and Policy Planning (MPP) at ACE, with AEO8 at the 42nd AMEM and Its Associated Meetings Press Conference Vientiane, 30 September 2024 – The ASEAN Centre for Energy (ACE) is delighted to announce the launch of the 8th edition of The ASEAN Energy Outlook (AEO8) during the 42nd ASEAN Ministers on Energy Meeting (AMEM) and Associated Meetings, held in conjunction with the 24th ASEAN Energy Business Forum (AEBF-24) in Vientiane, Lao PDR. AEO8 is a flagship report, developed by ACE in close collaboration with the ASEAN Member States and partners, that supports the implementation of the ASEAN Plan of Action for Energy Cooperation (APAEC). The report provides a ...

AEON Bank and Malaysia Esport Federation Signed MoU to Nurture E-Sports Talents Among Local Youth

 AEON Bank (M) Berhad and Malaysia Esports Federation (MESF) recently signed a Memorandum of Understanding (MoU), alongside two other strategic partners—Takaful Malaysia (Syarikat Takaful Malaysia Am Berhad) and Retromania. The MoU signing took place at Pavilion Bukit Jalil’s Exhibition Centre, with the signatories being Tsutomu Kamagata, Deputy CEO and Chief Corporate Management Officer of AEON Bank; YBrs. Naim Al-Amin, President of MESF; Hazruddin Hassan, Chief Marketing Officer of Takaful Malaysia; and an official representative from Retromania. The MoU signing ceremony was witnessed by YBrs. Suhaidi Alias, Sports Commissioner Malaysia; Afiq Fadhli Narawi, Vice President of MESF; Aizuddin Danian Izham Cheong, Chief Personal Banking Officer of AEON Bank (M) Berhad; and YBrs. Ahmed Faris Amir, CEO of Esports Integrated (ESI). The MoU signing was preceded by the launch of Piala Esports Malaysia and was followed by the full acceptance and commitment towards strict compliance to the ...

Celebrating Health and Wellness: A Reflection on the Time to Move Program

Samsung Malaysia Electronics, in collaboration with Sunway Medical Centre, Sunway City, came together to celebrate the remarkable achievements of participants in the Time to Move program. This initiative has provided an incredible opportunity for individuals to embark on a wellness journey, creating many inspiring stories. Throughout the course of the program, many participants have made significant strides in their efforts to improve their health. Their stories of perseverance and commitment to bettering themselves are a testament to the power of dedication. Participants not only reached their goals but also learnt valuable lessons on how to become healthier and more mindful of their well-being. In supporting their journeys, Samsung Health is a powerful resource of information for participants to make informed decisions about their health. Features such as Energy Score and Wellness Tips empowered participants to stay on track, making their wellness journeys more manageable and effecti...