Skip to content

Commit

Permalink
Add files via upload
Browse files Browse the repository at this point in the history
  • Loading branch information
ky219 authored Apr 9, 2024
1 parent f0bf2e5 commit 3d6e7f9
Show file tree
Hide file tree
Showing 2 changed files with 20 additions and 20 deletions.
24 changes: 12 additions & 12 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -70,10 +70,10 @@ <h3>Biography</h3>
I am currently a distinguished professor in the Department of Computer Science and Engineering at Shanghai Jiao Tong University (SJTU), as well as the co-founder and chief scientist of AISpeech. I am now leading the Institute of Intelligent Human-Computer Interaction of the Department of Computer Science, as well as the Center for Intelligent Speech and Natural Language Processing of the AI Institute of SJTU.
</p>
<p>
My academic journey began at the Department of Automation at Tsinghua University, where I completed my bachelor's and master's degrees in 1999 and 2002 respectively. I obtained my PhD at the Machine Intelligence Lab of the Engineering Department, Cambridge University, U.K. in 2006 and then worked as a senior research associate there. I joined SJTU in 2012 and founded <i>SpeechLab</i> at SJTU. Later, SpeechLab is extended and renamed as <i><a style="text-decoration:none" href="https://x-lance.sjtu.edu.cn/" target="_blank">Cross-media Language Intelligence (X-LANCE) Lab</a></i> as it is now.
My academic journey began at the Department of Automation at Tsinghua University, where I completed my bachelor and master degrees in 1999 and 2002 respectively. I obtained my PhD at the Machine Intelligence Lab of the Engineering Department, Cambridge University, U.K. in 2006 and then worked as a senior research associate there. I joined SJTU in 2012 and founded <i>SpeechLab</i> at SJTU. Later, SpeechLab is extended and renamed as <i><a style="text-decoration:none" href="https://x-lance.sjtu.edu.cn/" target="_blank">Cross-media Language Intelligence (X-LANCE) Lab</a></i> as it is now.
</p>
<p>
My research interests primarily lie in the field of conversational AI, including rich aspects of speech and language processing as well as multi-modal linguistic computing. The goal of my research is to build cognitive conversational agent which can operate in complex real-world environment, deal with uncertainty, deliver information in a humanized way and evolve via interacting with environment. I have published over 200 peer-reviewed journal and conference papers and won numerous paper awards. I used to serve as program chairs for InterSpeech, ICMI and SigDial, as well as area chairs of speech processing or dialogue systems for InterSpeech, ACL, EMNLP etc.
My research interests primarily lie in the field of conversational AI, including rich aspects of speech and language processing as well as multi-modal linguistic computing. The goal of my research is to build cognitive conversational agent which can operate in complex real-world environment, deal with uncertainty, deliver information in a humanized way and evolve via interacting with environment. I have published over 200 peer-reviewed journal and conference papers and won numerous paper awards. I used to serve as program chairs for Interspeech, ICMI and SigDial, as well as area chairs of speech processing or dialogue systems for Interspeech, ACL, EMNLP etc.
</p>
<p>
The outcome of my research have been both recognized in academia and successfully industrialized. I founded AISpeech to commercialize state-of-the-art speech and language processing technology. AISpeech has been selected into the “AI Key Players” list in the Equity Research Report of AI by Goldman Sachs in 2016 and one of the Cool Vendors for AI (East Asia) by Gartner in 2017. On behalf of AISpeech, I am also leading the National AI Open Innovation Platform on Language Computing, granted by Ministry of Science and Technology of China in 2022.
Expand All @@ -83,13 +83,13 @@ <h3>Biography</h3>
<hr>

<h3> SJTU X-LANCE Lab </h3>
&nbsp;&nbsp;&nbsp;&nbsp; <font color="DarkRed"><i>We are looking for self-motivated Ph.D./master/undergraudate students and postdocs interested in speech and language processing. Please send your CV to me if you want to join
&nbsp;&nbsp;&nbsp;&nbsp; <font color="DarkRed"><i>We are looking for self-motivated Ph.D./master/undergraduate students and postdocs interested in speech and language processing. Please send your CV to me if you want to join
us. </i></font><br/>


<h4>Research Interests</h4>
<ul>
<li> <i> Speech Processing: </i> neural speech sigal processing, robust speech and speaker recognition, high-fidelity speech synthesis, audio analysis and auditory cognition, multi-modal speech processing and universal audio model </li>
<li> <i> Speech and Audio Processing: </i> neural speech signal processing, robust speech and speaker recognition, high-fidelity speech synthesis, audio analysis and auditory cognition, multi-modal speech processing and universal audio model </li>
<li> <i> Natural Language Processing: </i> structured language understanding, KBQA and machine reading comprehension, statistical dialogue systems, multi-lingual language processing, foundation language model, large language model agent </li>
<li> <i> Multi-modal interaction: </i> digital avatar, GUI understanding and manipulation, AGI for science </li>
</ul>
Expand All @@ -112,7 +112,7 @@ <h3>Selected Publication <a class="grey" href="https://scholar.google.com/citati

<!-- </td></tr></table> -->

<h4>Speech Processing</h4>
<h4>Speech and Audio Processing</h4>
<ul>
<li>
<p><span class="tag blue-tag">ASR</span> <b>TDT-KWS: Fast and Accurate Keyword Spotting Using Token-and-duration Transducer</b><br/>
Expand Down Expand Up @@ -215,8 +215,8 @@ <h3>Professional Qualification and Service </h3>
<h4>Institute of Electrical and Electronics Engineers (IEEE)</h4>
<ul>
<li> Senior member of IEEE </li>
<li> Board Member of IEEE Signao Processing Society Conferences Board </li>
<li> Board Member of IEEE Signao Processing Society Membership Board </li>
<li> Board Member of IEEE Signal Processing Society Conferences Board </li>
<li> Board Member of IEEE Signal Processing Society Membership Board </li>
<li> Member of IEEE Speech and Language Processing Technical Committee (2017-2019) </li>
<li> Associate Editor of IEEE/ACM Transactions on Audio Speech and Language Processing </li>
</ul>
Expand All @@ -237,7 +237,7 @@ <h4>Chinese Information Processing Society of China (CIPSC)</h4>

<h4>Industry Service</h4>
<ul>
<li> Director of the National AI Open Innovation Platform on Language Computing, Ministry of Science and Technology of China </li>
<li> Director of the National AI Open Innovation Platform on Language Computing, Ministry of Science and Technology of China (MOST) </li>
<li> Member of the AI Key Technology and Application Evaluation Academic Committee of the Key Laboratory of the Ministry of Industry and Information Technology of China </li>
<li> Member of the Information System User Interfaces Branch (TC28/SC35) of the National Information Technology Standardization Technical Committee </li>
<li> Director of the Academic and Intellectual Property Working Group of the China Artificial Intelligence Industry Alliance (AIIA) </li>
Expand All @@ -255,7 +255,7 @@ <h4>Academic Conference Service</h4>
<ul>
<li> <b>ICASSP</b> </li>
<ul><li> IEEE SLTC Member </li></ul>
<li> <b>InterSpeech</b> </li>
<li> <b>Interspeech</b> </li>
<ul><li> Program Chair, Area Chair (Speech Recognition/Dialogue Systems) </li></ul>
<li> <b>EUSIPCO</b></li>
<ul><li> Area chair (Speech Processing) </li></ul>
Expand Down Expand Up @@ -287,12 +287,12 @@ <h4> Reviewer Service </h4>
<li> Journal of Automation (Chinese) </li>
</ul>
<li> <b> Conference </b> </li>
<ul><li> ICASSP, InterSpeech, IEEE ASRU, IEEE SLT, APSIPA, ISCSLP, ACL/NAACL/EACL, EMNLP, SigDial, NCMMSC </li></ul>
<ul><li> ICASSP, Interspeech, IEEE ASRU, IEEE SLT, APSIPA, ISCSLP, ACL/NAACL/EACL, EMNLP, SigDial, NCMMSC </li></ul>
<li> <b> Proposal and Award </b> </li>
<ul>
<li> EPSRC, U.K. </li>
<li> Science and Engineering Research Council, Agency for Science and Technology Research, Singapore </li>
<li> Israel Science Foundation, Israel </li>
<li> Israel Science Foundation (ISF), Israel </li>
<li> Research Grants Council (RGC) of Hong Kong </li>
<li> National Natural Science Foundation of China </li>
<li> Ministry of Science and Technology of China </li>
Expand All @@ -310,7 +310,7 @@ <h4> Best Paper Award </h4>
<li> EURASIP Speech Communication Best Paper Award </li>
<li> International Symposium on Chinese Spoken Language Processing Best Paper Award </li>
<li> ISCA Computer Speech and Language Best Paper Award </li>
<li> InterSpeech Best Paper Award </li>
<li> Interspeech Best Paper Award </li>
<li> IEEE SLT Best Paper Award </li>
<li> NCMMSC Best Paper Award </li>
</ul>
Expand Down
16 changes: 8 additions & 8 deletions index_zh.html
Original file line number Diff line number Diff line change
Expand Up @@ -69,12 +69,12 @@
<hr>

<h3>个人介绍</h3>
<!--&nbsp;&nbsp;&nbsp;&nbsp; 俞凯,现任上海交通大学计算机科学与工程系特聘教授、博导,思必驰公司联合创始人、首席科学家。国家“万人计划”科技创新领军人才,曾获国家自然科学基金委青年优青、上海市“东方学者”特聘教授。清华大学自动化系本科、硕士,英国剑桥大学工程系博士。长期从事人工智能领域的智能语音及语言处理、人机交互、模式识别及机器学习的研究和产业化工作。在语音识别及合成、自然语言理解 、口语对话系统、认知型人机交互等方面取得了一系列国际先进的研究、工程和产业化成果。在国际一流会议和期刊发表论文200余篇,获得Computer Speech and Language,Speech Communication 等顶级期刊最优论文奖和InterSpeech等多个顶级国际会议优秀论文奖,在语音识别、对话系统等一系列国际评测中获得冠军。他是国际电子电气工程师协会(IEEE)高级会员,中国大陆高校首个IEEE Speech and Language Processing Technical Committee 委员(2017-2019),IEEE Transactions on Audio Speech and Language Processing 副主编,曾任InterSpeech等国际会议程序委员会主席,ACL、EMNLP等国际会议研究领域主席。-->
<!--&nbsp;&nbsp;&nbsp;&nbsp; 俞凯,现任上海交通大学计算机科学与工程系特聘教授、博导,思必驰公司联合创始人、首席科学家。国家“万人计划”科技创新领军人才,曾获国家自然科学基金委青年优青、上海市“东方学者”特聘教授。清华大学自动化系本科、硕士,英国剑桥大学工程系博士。长期从事人工智能领域的智能语音及语言处理、人机交互、模式识别及机器学习的研究和产业化工作。在语音识别及合成、自然语言理解 、口语对话系统、认知型人机交互等方面取得了一系列国际先进的研究、工程和产业化成果。在国际一流会议和期刊发表论文200余篇,获得Computer Speech and Language,Speech Communication 等顶级期刊最优论文奖和Interspeech等多个顶级国际会议优秀论文奖,在语音识别、对话系统等一系列国际评测中获得冠军。他是国际电子电气工程师协会(IEEE)高级会员,中国大陆高校首个IEEE Speech and Language Processing Technical Committee 委员(2017-2019),IEEE Transactions on Audio Speech and Language Processing 副主编,曾任Interspeech等国际会议程序委员会主席,ACL、EMNLP等国际会议研究领域主席。-->
<p>
现任上海交通大学计算机科学与工程系特聘教授、博导,计算机系智能人机交互研究所所长,上海交通大学人工智能研究院语音及语言处理中心主任,思必驰公司联合创始人及首席科学家。国家高层次人才项目获得者,科技部中青年科技创新领军人才,国家自然科学基金委优青,上海市“东方学者”特聘教授。清华大学自动化系本科(1999)、硕士(2002),英国剑桥大学工程系博士(2006)。2012年回国在上海交通大学创立智能语音实验室(SpeechLab),后扩展并更名为跨媒体语言智能实验室(X-LANCE)。
</p>
<p>
研究兴趣主要集中在人工智能领域,尤其是以对话为核心的智能语音及自然语言处理,涵盖了语音信号处理、语音识别及合成、音频分析、语言理解、对话管理、语言基础模型、多模态语音及语言处理等方面。研究目标是构建认知型对话智能体,它可以在复杂的现实环境中运行,处理不确定性,以人性化的方式传递信息并通过与环境交互而不断进化。已在国际一流的会议和期刊上发表了200余篇论文,并获得了包括Computer Speech and Language、Speech Communication等顶级期刊的最优论文奖,InterSpeech等多个顶级国际会议的优秀论文奖,以及一系列国际研究评测的冠军。现任IEEE高级会员,作为中国大陆高校首位入选者,曾任 IEEE Speech and Language Processing Technical Committee 委员(2017-2019)。曾任InterSpeech、ICMI、SigDial等国际会议的程序委员会主席,全国人机语音通讯会议大会主席,以及ACL、EMNLP等国际会议的研究领域主席。现任中国计算机学会(CCF)杰出会员,CCF语音对话及听觉专委会主任,中文信息学会(CIPSC)第九届理事会理事,语音信息处理专委会副主任。
研究兴趣主要集中在人工智能领域,尤其是以对话为核心的智能语音及自然语言处理,涵盖了语音信号处理、语音识别及合成、音频分析、语言理解、对话管理、语言基础模型、多模态语音及语言处理等方面。研究目标是构建认知型对话智能体,它可以在复杂的现实环境中运行,处理不确定性,以人性化的方式传递信息并通过与环境交互而不断进化。已在国际一流的会议和期刊上发表了200余篇论文,并获得了包括Computer Speech and Language、Speech Communication等顶级期刊的最优论文奖,Interspeech等多个顶级国际会议的优秀论文奖,以及一系列国际研究评测的冠军。现任IEEE高级会员,作为中国大陆高校首位入选者,曾任 IEEE Speech and Language Processing Technical Committee 委员(2017-2019)。曾任Interspeech、ICMI、SigDial等国际会议的程序委员会主席,全国人机语音通讯会议大会主席,以及ACL、EMNLP等国际会议的研究领域主席。现任中国计算机学会(CCF)杰出会员,CCF语音对话及听觉专委会主任,中文信息学会(CIPSC)第九届理事会理事,语音信息处理专委会副主任。
</p>
<p>
相关研究成果不仅在学术界得到了认可,也成功实现了大规模产业化。作为联合创始人创立“思必驰信息科技有限公司”,任首席科学家,进行智能口语对话交互技术的产业化。思必驰公司因在人工智能技术和产业化方面的领先性,2016年作为中国仅有的两家人工智能创业公司之一,入选高盛发布的全球人工智能报告中的“Key AI Players”;2017年作为中国仅有的三家人工智能公司之一,入选国际权威IT咨询机构Gartner发布的“Cool Vendors for AI (East Asia)”列表。2022年,思必驰被科技部授予“语言计算国家新一代人工智能开放创新平台”,成为国家级的人工智能战略力量。
Expand All @@ -88,7 +88,7 @@ <h3> 上海交通大学跨媒体语言智能实验室 </h3>

<h4>研究兴趣</h4>
<ul>
<li> <i> 语音信息处理</i> 神经语音信号处理,鲁棒语音及声纹识别,高逼真度语音合成,丰富音频分析及听觉认知,多模态语音处理及通用语音大模型 </li>
<li> <i> 语音及音频信息处理</i> 神经语音信号处理,鲁棒语音及声纹识别,高逼真度语音合成,丰富音频分析及听觉认知,多模态语音处理及通用语音大模型 </li>
<li> <i> 自然语言处理:</i> 意图及结构化语言理解,知识问答及阅读理解,统计对话系统,多语种语言处理,语言基础大模型,大模型智能体系统 </li>
<li> <i> 多模态交互:</i> 可控数字人,图形界面理解及交互,科学通用智能体 </li>
</ul>
Expand All @@ -111,7 +111,7 @@ <h3> 论文摘选 <a class="grey" href="https://scholar.google.com/citations?use

<!-- </td></tr></table> -->

<h4>语音信息处理</h4>
<h4>语音及音频信息处理</h4>
<ul>
<li>
<p><span class="tag blue-tag">ASR</span> <b>TDT-KWS: Fast and Accurate Keyword Spotting Using Token-and-duration Transducer</b><br/>
Expand Down Expand Up @@ -254,7 +254,7 @@ <h4> 学术会议服务 </h4>
<ul>
<li> <b>ICASSP</b> </li>
<ul><li> IEEE 语音语言处理技术委员会委员 </li></ul>
<li> <b>InterSpeech</b> </li>
<li> <b>Interspeech</b> </li>
<ul><li> 程序委员会主席,研究领域主席(语音识别/对话系统) </li></ul>
<li> <b>EUSIPCO</b></li>
<ul><li> 研究领域主席(语音处理) </li></ul>
Expand Down Expand Up @@ -286,12 +286,12 @@ <h4> 审稿人 </h4>
<li> 自动化学报 </li>
</ul>
<li> <b> 会议 </b> </li>
<ul><li> ICASSP, InterSpeech, IEEE ASRU, IEEE SLT, APSIPA, ISCSLP, ACL/NAACL/EACL, EMNLP, SigDial, 全国人机语音通讯会议(NCMMSC) </li></ul>
<ul><li> ICASSP, Interspeech, IEEE ASRU, IEEE SLT, APSIPA, ISCSLP, ACL/NAACL/EACL, EMNLP, SigDial, 全国人机语音通讯会议(NCMMSC) </li></ul>
<li> <b> 项目及奖项 </b> </li>
<ul>
<li> 国家自然科学基金委、科技部、工信部、教育部、中科院 </li>
<li> 香港研究资助局(RGC) </li>
<li> 以色列科学基金会 </li>
<li> 以色列科学基金会(ISF) </li>
<li> 英国工程及物理科学研究理事会(EPSRC) </li>
<li> 新加坡科学及技术研发局下属科学及工程研究理事会(Science and Engineering Research Council, Agency for Science and Technology Research) </li>
</ul>
Expand All @@ -305,7 +305,7 @@ <h4> 最优论文奖 </h4>
<li> EURASIP Speech Communication 最优期刊论文奖 </li>
<li> International Symposium on Chinese Spoken Language Processing 最优会议论文奖 </li>
<li> ISCA Computer Speech and Language 最优期刊论文奖 </li>
<li> InterSpeech 最优会议论文奖 </li>
<li> Interspeech 最优会议论文奖 </li>
<li> IEEE SLT 最优会议论文奖 </li>
<li> NCMMSC Best 最优会议论文奖 </li>
</ul>
Expand Down

0 comments on commit 3d6e7f9

Please sign in to comment.