김남수 연구실

서울대학교 전기정보공학부

수집 현황75%

오류 신고

김남수 연구실

서울대학교 전기정보공학부

수집 현황75%

오류 신고

1편최근 5년 논문

17명구성원

기본 정보

대학 웹사이트

김

김남수

Nam-Soo Kim

서울대학교 전기정보공학부

인공지능, 음성음향신호처리

연구실 웹사이트정상 접속

nkim@snu.ac.kr

연구실 소개

Established in 1998 and directed by Prof. N.S.Kim, the Human Interface Laboratory conducts research on speech and audio signal processing. Current ongoing research topics include speech recognition, speech synthesis, speech enhancement, realistic acoustics, acoustic event detection, audio source seperation, and audio source localization with applications from machine learning.

연구 분야

대학 웹사이트

Speech Signal Processing

Automatic Speech RecognitionSpeech SynthesisSpeech EnhancementSpeech Coding

Core research area covering automatic speech recognition, speech synthesis, speech enhancement, and speech coding for human-machine interface systems.

Automatic Speech Recognition

ASRspeech-to-texthuman-machine interface

The task of converting speech utterance into text. ASR is a core technique of human-machine interface system with applications in smart home and phone interfaces.

Speech Synthesis

text-to-speechvoice synthesisspeech generation

Technique of synthesizing text input into speech, actively used in smartphone interfaces, personal assistants, ARS, and robot interfaces.

Speech Enhancement

speech qualitynoise reductionaudio processing

Improves degraded speech intelligibility and quality using audio signal processing techniques. Applications include mobile systems, hearing aids, and ASR.

Speech Coding

speech compressionaudio codingdata compression

Application of data compression of digital audio signals containing speech to minimize transmission bandwidth or reduce storage costs.

AI & Machine Learning

Artificial IntelligenceMachine LearningDNNNMFHMMSVM

Research on implementing algorithms for machine learning applications in speech and audio signal processing, including HMM, SVM, DNN, and NMF techniques.

Artificial Intelligence

machine learningdialogue systemsauditory scene understanding

Implements algorithms allowing machines to perform human-like tasks including automatic speech recognition, machine hearing, dialogue systems, and auditory scene understanding.

Machine Learning

data analysispattern recognitionlearning algorithms

Provides computers with ability to learn and analyze data without explicit programming. Leading field utilizing machine learning techniques in speech/audio signal processing.

Audio Signal Processing

Realistic Audio TechnologyAudio Scene RecognitionAcoustic LocalizationSound CodeAudio Source Separation

Research area covering realistic audio technology, audio scene recognition, acoustic localization, sound code, and audio source separation.

Realistic Audio Technology

spatial audio3D soundroom acoustics

Method for reproducing spatial sound using recorded anechoic sound sources and measured room impulse responses. Applications include 3D realistic audio systems.

Audio Scene Recognition

sound event detectionacoustic environmentaudio classification

Computational analysis of acoustic environment and recognition of distinct sound events, focusing on recognizing context and analyzing discrete sound events.

Acoustic Localization

sound localizationsource trackinglocation-based services

Studies localization or tracking of acoustic sources based on sound field measurements. Applications include virtual tour guides, item tracking, and shopping mall navigation.

Sound Code

data hidingwireless transmissionaudio encoding

Wireless data transmission system encoding data using audio data hiding technology. Applications include inserting product information and coupon codes into advertisements.

Audio Source Separation

source separationsignal extractionnoise removal

Technique of extracting single or several signals of interest from mixture signals, removing unwanted components from recordings.

출처: 연구실 홈페이지

학생 구성

대학 웹사이트1주 전

현재 재학생

0명

최근 5년 졸업

0명

학생 정보 수집 중 — 집계가 모이면 이 섹션에 분포가 표시됩니다.

본 페이지는 연구실 규모 파악을 위한 집계 통계(구성원 수, 진로 카테고리, 학위 과정 분포)만 제공하며, 개별 학생의 이름·전적·취업처 등은 표시하지 않습니다. 학위 과정 분포는 모든 재학생의 과정이 명확히 분류된 경우에만 표시되며 (분류 미상 학생이 1명이라도 있으면 미표시), k≥5 익명성 조건을 충족할 때만 공개됩니다 (PIPA §58-2·§28-2 + 대법원 2014다235080).

최근 논문

OpenAlex1개월 전

Novel Deep Learning-Based Vocal Biomarkers for Stress Detection in Koreans

Psychiatry InvestigationJournal

2024년 11월 18일인용 6

본 연구는 한국인 대상의 음성 기반 정신 스트레스 평가의 가능성을 강조하며, 다양한 언어 인구집단에서 음성 바이오마커에 대한 지속적인 연구의 중요성을 제시합니다.

AI 생성원문 보기

논문 트렌드

OpenAlex

연구 키워드

논문 데이터가 수집되면 연구 키워드가 자동 추출됩니다

연구 분야

대학 웹사이트

Speech Signal Processing

Automatic Speech RecognitionSpeech SynthesisSpeech EnhancementSpeech Coding

Core research area covering automatic speech recognition, speech synthesis, speech enhancement, and speech coding for human-machine interface systems.

Automatic Speech Recognition

ASRspeech-to-texthuman-machine interface

The task of converting speech utterance into text. ASR is a core technique of human-machine interface system with applications in smart home and phone interfaces.

Speech Synthesis

text-to-speechvoice synthesisspeech generation

Technique of synthesizing text input into speech, actively used in smartphone interfaces, personal assistants, ARS, and robot interfaces.

Speech Enhancement

speech qualitynoise reductionaudio processing

Improves degraded speech intelligibility and quality using audio signal processing techniques. Applications include mobile systems, hearing aids, and ASR.

Speech Coding

speech compressionaudio codingdata compression

Application of data compression of digital audio signals containing speech to minimize transmission bandwidth or reduce storage costs.

AI & Machine Learning

Artificial IntelligenceMachine LearningDNNNMFHMMSVM

Research on implementing algorithms for machine learning applications in speech and audio signal processing, including HMM, SVM, DNN, and NMF techniques.

Artificial Intelligence

machine learningdialogue systemsauditory scene understanding

Implements algorithms allowing machines to perform human-like tasks including automatic speech recognition, machine hearing, dialogue systems, and auditory scene understanding.

Machine Learning

data analysispattern recognitionlearning algorithms

Provides computers with ability to learn and analyze data without explicit programming. Leading field utilizing machine learning techniques in speech/audio signal processing.

Audio Signal Processing

Realistic Audio TechnologyAudio Scene RecognitionAcoustic LocalizationSound CodeAudio Source Separation

Research area covering realistic audio technology, audio scene recognition, acoustic localization, sound code, and audio source separation.

Realistic Audio Technology

spatial audio3D soundroom acoustics

Method for reproducing spatial sound using recorded anechoic sound sources and measured room impulse responses. Applications include 3D realistic audio systems.

Audio Scene Recognition

sound event detectionacoustic environmentaudio classification

Computational analysis of acoustic environment and recognition of distinct sound events, focusing on recognizing context and analyzing discrete sound events.

Acoustic Localization

sound localizationsource trackinglocation-based services

Studies localization or tracking of acoustic sources based on sound field measurements. Applications include virtual tour guides, item tracking, and shopping mall navigation.

Sound Code

data hidingwireless transmissionaudio encoding

Wireless data transmission system encoding data using audio data hiding technology. Applications include inserting product information and coupon codes into advertisements.

Audio Source Separation

source separationsignal extractionnoise removal

Technique of extracting single or several signals of interest from mixture signals, removing unwanted components from recordings.

출처: 연구실 홈페이지