Skip to yearly menu bar Skip to main content



Abstract:

We share MeraLion (Multimodal Empathetic Reasoning and Learning In One Network), our multimodal generative AI efforts in Singapore’s National Multimodal Large Language Model Programme. Speech and audio modeling uncover spatial, temporal, and social dynamics that remain invisible to text-based models, making them essential for richer, more grounded understanding. The interplay of cultural nuance and multilingualism further deepens the complexity of interpreting human intentions, behavior, and interactions. Empathetic reasoning is a prime example for human-AI synergistic collaboration, spanning applications in education, healthcare, embodied learning and robotics, and policy forming. We highlight recent research endeavors, technology deployment experience and application opportunities, including SingaKids AI Tutor that encourage young children to learn.

Chat is not available.