Dongyu Zhang

Dongyu Zhang

Professor of Computational Linguistics and Natural Language Processing
School of Foreign Languages and the School of Software Technology
Dalian University of Technology, Dalian, China

Research Areas

Computational Linguistics and NLP

Building algorithms to understand and process human language, including translation, speech recognition, and text analysis.

Metaphor Computing

Studying computational models to process metaphorical and abstract language, improving AI’s interpretative abilities.

Sentiment Analysis

Utilizing AI to identify and classify emotional tone in text for applications in social media, reviews, and healthcare.

AI for Health

Using AI to improve diagnostics, predict diseases, and enhance healthcare outcomes based on real-time data insights.

AI for Education

Leveraging AI to enhance educational processes, personalized learning, and student outcomes through adaptive technologies.

Neurolinguistics

Exploring how the brain processes language through AI insights, contributing to better understanding of cognitive functions.

Datasets

Dataset 4
MultiMET: A Multimodal Dataset for Metaphor Understanding

MultiMET is a novel multimodal metaphor dataset designed to aid in understanding metaphorical information from text and image. It includes 10,437 text-image pairs from various sources, with multimodal annotations for metaphor occurrences, domain relations, conveyed sentiments, and author intents.

Download Dataset
Dataset 5
MultiCMET: A Novel Chinese Benchmark for Understanding Multimodal Metaphor

MultiCMET is a multimodal Chinese metaphor dataset comprising 13,820 text-image pairs from advertisements, with manual annotations for metaphor occurrences, source and target domain categories, and the sentiments conveyed by the metaphors.

Download Dataset
Dataset 1
Language Understanding Corpus

A large-scale dataset for training models in language understanding, including text, audio, and multimodal sources.

Download Dataset
Dataset 2
Metaphor Analysis Data

Annotated data focusing on metaphorical language, aiding in training AI to recognize and interpret metaphors.

Download Dataset
Dataset 3
Sentiment Classification Set

A dataset with labeled text data for training sentiment analysis algorithms, including positive, neutral, and negative classifications.

Download Dataset