EmpowerYou - Huawei Tech4City Competition

Overview

This project aims to develop a system for generating meeting analysis for meeting attendees using multimodal inputs. The system extracts features from speech and images to classify personal traits, ultimately generating attendee-specific reports.

Input Types

Speech Signal
Image

Workflow

First Stage: Multimodal Feature Extraction

Automatic Speaker Recognition: Processes the speech signal to identify and recognize the speaker.
Body Keypoints Detection: Analyzes images to detect key points on the body, which are critical for understanding body language.
Speaker Diarization: Separates and identifies speech segments according to different speakers within the meeting context.

Second Stage: Personal Traits Analysis

BERT-based Personal Traits Classification: Uses a BERT model to classify personal traits based on the recognized speaker's voice.
Body Language Classification: Utilizes the detected body keypoints to classify the body language, providing insights into non-verbal cues.

Last Stage: Report Generation

Small Language Model: Integrates the personal traits information derived from speech and body language analyses to generate a comprehensive report.
Meeting Attendee-specific Report: Compiles all analyzed data into detailed, attendee-specific reports which can be used for further review and analysis.

Outcome

The system provides detailed, individualized reports for each meeting attendee, offering insights based on multimodal analysis of their speech and body language. This aids in understanding personal traits and behaviors during meetings, facilitating better communication and interaction strategies.

Usage

Data Input: Provide speech signals and images of meeting attendees.
Run Workflow: Execute the provided code to process multimodal inputs, analyze personal traits, and generate reports.
Review Reports: Access the generated attendee-specific reports for insights into meeting dynamics and attendee behaviors.

Dependencies

Find in requirements.txt

Contributors

Yan Jinjiang, Ananya Varshney, George Ong, Cheo Le Xian, Ninad Dixit, Wong Yen Heng

Notes

This repository mainly shows the lowest hanging fruit of the development, it is far from optimal. If the team gets into semi-final round, this repository will be improved and scaled to production level.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.idea		.idea
bert_personality		bert_personality
body_language		body_language
whisper		whisper
README.md		README.md
main.py		main.py
requirement.txt		requirement.txt
small_lm.py		small_lm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EmpowerYou - Huawei Tech4City Competition

Overview

Input Types

Workflow

First Stage: Multimodal Feature Extraction

Second Stage: Personal Traits Analysis

Last Stage: Report Generation

Outcome

Usage

Dependencies

Contributors

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EmpowerYou - Huawei Tech4City Competition

Overview

Input Types

Workflow

First Stage: Multimodal Feature Extraction

Second Stage: Personal Traits Analysis

Last Stage: Report Generation

Outcome

Usage

Dependencies

Contributors

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages