{"id":"https://openalex.org/W2146507015","doi":"https://doi.org/10.1002/scj.4690271405","title":"Speech recognition in nonstationary noise based on parallel HMMs and spectral subtraction","display_name":"Speech recognition in nonstationary noise based on parallel HMMs and spectral subtraction","publication_year":1996,"publication_date":"1996-01-01","ids":{"openalex":"https://openalex.org/W2146507015","doi":"https://doi.org/10.1002/scj.4690271405","mag":"2146507015"},"language":"en","primary_location":{"id":"doi:10.1002/scj.4690271405","is_oa":false,"landing_page_url":"https://doi.org/10.1002/scj.4690271405","pdf_url":null,"source":{"id":"https://openalex.org/S58208175","display_name":"Systems and Computers in Japan","issn_l":"0882-1666","issn":["0882-1666","1520-684X"],"is_oa":false,"is_in_doaj":false,"is_core":true,"host_organization":"https://openalex.org/P4310320595","host_organization_name":"Wiley","host_organization_lineage":["https://openalex.org/P4310320595"],"host_organization_lineage_names":["Wiley"],"type":"journal"},"license":null,"license_id":null,"version":"publishedVersion","is_accepted":true,"is_published":true,"raw_source_name":"Systems and Computers in Japan","raw_type":"journal-article"},"type":"article","indexed_in":["crossref"],"open_access":{"is_oa":false,"oa_status":"closed","oa_url":null,"any_repository_has_fulltext":false},"authorships":[{"author_position":"first","author":{"id":"https://openalex.org/A5076088175","display_name":"Ryuji Mine","orcid":"https://orcid.org/0000-0002-0130-6752"},"institutions":[{"id":"https://openalex.org/I65143321","display_name":"Hitachi (Japan)","ror":"https://ror.org/02exqgm79","country_code":"JP","type":"company","lineage":["https://openalex.org/I65143321"]},{"id":"https://openalex.org/I150744194","display_name":"Waseda University","ror":"https://ror.org/00ntfnx83","country_code":"JP","type":"education","lineage":["https://openalex.org/I150744194"]}],"countries":["JP"],"is_corresponding":true,"raw_author_name":"Ryuji Mine","raw_affiliation_strings":["Currently with Hitachi Ltd","Ryuji Mine:  graduated in 1992 from the Dept. Electrical Eng., Waseda Univ., where he received his Master's degree in 1995 and affiliatedwith Hitachi Ltd. As a graduate student, he engaged in research on speech recognition. He is a member of the Acoust. Soc. Japan","School of Science and Engineering, Waseda University, Tokyo, Japan 169"],"affiliations":[{"raw_affiliation_string":"Currently with Hitachi Ltd","institution_ids":["https://openalex.org/I65143321"]},{"raw_affiliation_string":"Ryuji Mine:  graduated in 1992 from the Dept. Electrical Eng., Waseda Univ., where he received his Master's degree in 1995 and affiliatedwith Hitachi Ltd. As a graduate student, he engaged in research on speech recognition. He is a member of the Acoust. Soc. Japan","institution_ids":[]},{"raw_affiliation_string":"School of Science and Engineering, Waseda University, Tokyo, Japan 169","institution_ids":["https://openalex.org/I150744194"]}]},{"author_position":"middle","author":{"id":"https://openalex.org/A5101188700","display_name":"Tetsunori Kobayashi","orcid":null},"institutions":[{"id":"https://openalex.org/I1343180700","display_name":"Intel (United States)","ror":"https://ror.org/01ek73717","country_code":"US","type":"company","lineage":["https://openalex.org/I1343180700"]},{"id":"https://openalex.org/I150744194","display_name":"Waseda University","ror":"https://ror.org/00ntfnx83","country_code":"JP","type":"education","lineage":["https://openalex.org/I150744194"]}],"countries":["JP","US"],"is_corresponding":false,"raw_author_name":"Tetsunori Kobayashi","raw_affiliation_strings":["School of Science and Engineering, Waseda University, Tokyo, Japan 169","Tetsunori Kobayashi:  graduated in 1980 from the Dept. Electrical Eng., Waseda Univ., where he received his Dr. of Eng. degree in 1985. He was a Lecturer in 1985 and Assoc. Prof. in 1987 in the Dept. Electrical Eng., Hosei Univ., and Assoc. Prof. in 1991 in the Dept. Electrical Eng., Waseda Univ. He is engaged in research on speech information processing. He is a member of Acoust. Soc. Jap.; Soc. Artif. Intel.; Inf. Proc. Soc.; Robot. Soc. Jap.; and IEEE","Robot. Soc. Jap","IEEE","Inf. Proc. Soc","Tetsunori Kobayashi:  graduated in 1980 from the Dept. Electrical Eng., Waseda Univ., where he received his Dr. of Eng. degree in 1985. He was a Lecturer in 1985 and Assoc. Prof. in 1987 in the Dept. Electrical Eng., Hosei Univ., and Assoc. Prof. in 1991 in the Dept. Electrical Eng., Waseda Univ. He is engaged in research on speech information processing. He is a member of Acoust. Soc. Jap","Soc. Artif. Intel"],"affiliations":[{"raw_affiliation_string":"School of Science and Engineering, Waseda University, Tokyo, Japan 169","institution_ids":["https://openalex.org/I150744194"]},{"raw_affiliation_string":"Tetsunori Kobayashi:  graduated in 1980 from the Dept. Electrical Eng., Waseda Univ., where he received his Dr. of Eng. degree in 1985. He was a Lecturer in 1985 and Assoc. Prof. in 1987 in the Dept. Electrical Eng., Hosei Univ., and Assoc. Prof. in 1991 in the Dept. Electrical Eng., Waseda Univ. He is engaged in research on speech information processing. He is a member of Acoust. Soc. Jap.; Soc. Artif. Intel.; Inf. Proc. Soc.; Robot. Soc. Jap.; and IEEE","institution_ids":[]},{"raw_affiliation_string":"Robot. Soc. Jap","institution_ids":[]},{"raw_affiliation_string":"IEEE","institution_ids":[]},{"raw_affiliation_string":"Inf. Proc. Soc","institution_ids":[]},{"raw_affiliation_string":"Tetsunori Kobayashi:  graduated in 1980 from the Dept. Electrical Eng., Waseda Univ., where he received his Dr. of Eng. degree in 1985. He was a Lecturer in 1985 and Assoc. Prof. in 1987 in the Dept. Electrical Eng., Hosei Univ., and Assoc. Prof. in 1991 in the Dept. Electrical Eng., Waseda Univ. He is engaged in research on speech information processing. He is a member of Acoust. Soc. Jap","institution_ids":[]},{"raw_affiliation_string":"Soc. Artif. Intel","institution_ids":["https://openalex.org/I1343180700"]}]},{"author_position":"last","author":{"id":"https://openalex.org/A5109058482","display_name":"Katsuhiko Shirai","orcid":null},"institutions":[{"id":"https://openalex.org/I150744194","display_name":"Waseda University","ror":"https://ror.org/00ntfnx83","country_code":"JP","type":"education","lineage":["https://openalex.org/I150744194"]}],"countries":["JP"],"is_corresponding":false,"raw_author_name":"Katsuhiko Shirai","raw_affiliation_strings":["Katsuhiko Shirai:  graduated in 1963 from the Dept. Electrical Eng., Waseda Univ. where he received his Dr. of Eng. degree in 1968. He was a Lecturer in 1968, Assoc. Prof., and since 1975 has been a Prof., Dept. Electrical Eng., Waseda Univ. He is engaged in research on human interface, emphasizing speech recognitiodsynthesis techniques, natural language processing, design of signal processing-oriented architecture and CAI. He is a member of Inf. Proc. Soc.; Acoust. Soc. Jap.; and IEEE","School of Science and Engineering, Waseda University, Tokyo, Japan 169","IEEE","Acoust. Soc. Jap","Katsuhiko Shirai:  graduated in 1963 from the Dept. Electrical Eng., Waseda Univ. where he received his Dr. of Eng. degree in 1968. He was a Lecturer in 1968, Assoc. Prof., and since 1975 has been a Prof., Dept. Electrical Eng., Waseda Univ. He is engaged in research on human interface, emphasizing speech recognitiodsynthesis techniques, natural language processing, design of signal processing-oriented architecture and CAI. He is a member of Inf. Proc. Soc"],"affiliations":[{"raw_affiliation_string":"Katsuhiko Shirai:  graduated in 1963 from the Dept. Electrical Eng., Waseda Univ. where he received his Dr. of Eng. degree in 1968. He was a Lecturer in 1968, Assoc. Prof., and since 1975 has been a Prof., Dept. Electrical Eng., Waseda Univ. He is engaged in research on human interface, emphasizing speech recognitiodsynthesis techniques, natural language processing, design of signal processing-oriented architecture and CAI. He is a member of Inf. Proc. Soc.; Acoust. Soc. Jap.; and IEEE","institution_ids":[]},{"raw_affiliation_string":"School of Science and Engineering, Waseda University, Tokyo, Japan 169","institution_ids":["https://openalex.org/I150744194"]},{"raw_affiliation_string":"IEEE","institution_ids":[]},{"raw_affiliation_string":"Acoust. Soc. Jap","institution_ids":[]},{"raw_affiliation_string":"Katsuhiko Shirai:  graduated in 1963 from the Dept. Electrical Eng., Waseda Univ. where he received his Dr. of Eng. degree in 1968. He was a Lecturer in 1968, Assoc. Prof., and since 1975 has been a Prof., Dept. Electrical Eng., Waseda Univ. He is engaged in research on human interface, emphasizing speech recognitiodsynthesis techniques, natural language processing, design of signal processing-oriented architecture and CAI. He is a member of Inf. Proc. Soc","institution_ids":[]}]}],"institutions":[],"countries_distinct_count":2,"institutions_distinct_count":3,"corresponding_author_ids":["https://openalex.org/A5076088175"],"corresponding_institution_ids":["https://openalex.org/I150744194","https://openalex.org/I65143321"],"apc_list":null,"apc_paid":null,"fwci":0.0,"has_fulltext":false,"cited_by_count":2,"citation_normalized_percentile":{"value":0.29528302,"is_in_top_1_percent":false,"is_in_top_10_percent":false},"cited_by_percentile_year":null,"biblio":{"volume":"27","issue":"14","first_page":"37","last_page":"44"},"is_retracted":false,"is_paratext":false,"is_xpac":false,"primary_topic":{"id":"https://openalex.org/T10860","display_name":"Speech and Audio Processing","score":0.9998999834060669,"subfield":{"id":"https://openalex.org/subfields/1711","display_name":"Signal Processing"},"field":{"id":"https://openalex.org/fields/17","display_name":"Computer Science"},"domain":{"id":"https://openalex.org/domains/3","display_name":"Physical Sciences"}},"topics":[{"id":"https://openalex.org/T10860","display_name":"Speech and Audio Processing","score":0.9998999834060669,"subfield":{"id":"https://openalex.org/subfields/1711","display_name":"Signal Processing"},"field":{"id":"https://openalex.org/fields/17","display_name":"Computer Science"},"domain":{"id":"https://openalex.org/domains/3","display_name":"Physical Sciences"}},{"id":"https://openalex.org/T10201","display_name":"Speech Recognition and Synthesis","score":0.9994000196456909,"subfield":{"id":"https://openalex.org/subfields/1702","display_name":"Artificial Intelligence"},"field":{"id":"https://openalex.org/fields/17","display_name":"Computer Science"},"domain":{"id":"https://openalex.org/domains/3","display_name":"Physical Sciences"}},{"id":"https://openalex.org/T11309","display_name":"Music and Audio Processing","score":0.9959999918937683,"subfield":{"id":"https://openalex.org/subfields/1711","display_name":"Signal Processing"},"field":{"id":"https://openalex.org/fields/17","display_name":"Computer Science"},"domain":{"id":"https://openalex.org/domains/3","display_name":"Physical Sciences"}}],"keywords":[{"id":"https://openalex.org/keywords/subtraction","display_name":"Subtraction","score":0.7581517696380615},{"id":"https://openalex.org/keywords/computer-science","display_name":"Computer science","score":0.7380833625793457},{"id":"https://openalex.org/keywords/speech-recognition","display_name":"Speech recognition","score":0.7212217450141907},{"id":"https://openalex.org/keywords/noise","display_name":"Noise (video)","score":0.6700016856193542},{"id":"https://openalex.org/keywords/hidden-markov-model","display_name":"Hidden Markov model","score":0.6163789629936218},{"id":"https://openalex.org/keywords/set","display_name":"Set (abstract data type)","score":0.5371637344360352},{"id":"https://openalex.org/keywords/pattern-recognition","display_name":"Pattern recognition (psychology)","score":0.5259031653404236},{"id":"https://openalex.org/keywords/word","display_name":"Word (group theory)","score":0.5177316069602966},{"id":"https://openalex.org/keywords/task","display_name":"Task (project management)","score":0.43043118715286255},{"id":"https://openalex.org/keywords/artificial-intelligence","display_name":"Artificial intelligence","score":0.4026815891265869},{"id":"https://openalex.org/keywords/mathematics","display_name":"Mathematics","score":0.19807752966880798},{"id":"https://openalex.org/keywords/arithmetic","display_name":"Arithmetic","score":0.0932857096195221}],"concepts":[{"id":"https://openalex.org/C68060419","wikidata":"https://www.wikidata.org/wiki/Q40754","display_name":"Subtraction","level":2,"score":0.7581517696380615},{"id":"https://openalex.org/C41008148","wikidata":"https://www.wikidata.org/wiki/Q21198","display_name":"Computer science","level":0,"score":0.7380833625793457},{"id":"https://openalex.org/C28490314","wikidata":"https://www.wikidata.org/wiki/Q189436","display_name":"Speech recognition","level":1,"score":0.7212217450141907},{"id":"https://openalex.org/C99498987","wikidata":"https://www.wikidata.org/wiki/Q2210247","display_name":"Noise (video)","level":3,"score":0.6700016856193542},{"id":"https://openalex.org/C23224414","wikidata":"https://www.wikidata.org/wiki/Q176769","display_name":"Hidden Markov model","level":2,"score":0.6163789629936218},{"id":"https://openalex.org/C177264268","wikidata":"https://www.wikidata.org/wiki/Q1514741","display_name":"Set (abstract data type)","level":2,"score":0.5371637344360352},{"id":"https://openalex.org/C153180895","wikidata":"https://www.wikidata.org/wiki/Q7148389","display_name":"Pattern recognition (psychology)","level":2,"score":0.5259031653404236},{"id":"https://openalex.org/C90805587","wikidata":"https://www.wikidata.org/wiki/Q10944557","display_name":"Word (group theory)","level":2,"score":0.5177316069602966},{"id":"https://openalex.org/C2780451532","wikidata":"https://www.wikidata.org/wiki/Q759676","display_name":"Task (project management)","level":2,"score":0.43043118715286255},{"id":"https://openalex.org/C154945302","wikidata":"https://www.wikidata.org/wiki/Q11660","display_name":"Artificial intelligence","level":1,"score":0.4026815891265869},{"id":"https://openalex.org/C33923547","wikidata":"https://www.wikidata.org/wiki/Q395","display_name":"Mathematics","level":0,"score":0.19807752966880798},{"id":"https://openalex.org/C94375191","wikidata":"https://www.wikidata.org/wiki/Q11205","display_name":"Arithmetic","level":1,"score":0.0932857096195221},{"id":"https://openalex.org/C187736073","wikidata":"https://www.wikidata.org/wiki/Q2920921","display_name":"Management","level":1,"score":0.0},{"id":"https://openalex.org/C162324750","wikidata":"https://www.wikidata.org/wiki/Q8134","display_name":"Economics","level":0,"score":0.0},{"id":"https://openalex.org/C199360897","wikidata":"https://www.wikidata.org/wiki/Q9143","display_name":"Programming language","level":1,"score":0.0},{"id":"https://openalex.org/C2524010","wikidata":"https://www.wikidata.org/wiki/Q8087","display_name":"Geometry","level":1,"score":0.0},{"id":"https://openalex.org/C115961682","wikidata":"https://www.wikidata.org/wiki/Q860623","display_name":"Image (mathematics)","level":2,"score":0.0}],"mesh":[],"locations_count":1,"locations":[{"id":"doi:10.1002/scj.4690271405","is_oa":false,"landing_page_url":"https://doi.org/10.1002/scj.4690271405","pdf_url":null,"source":{"id":"https://openalex.org/S58208175","display_name":"Systems and Computers in Japan","issn_l":"0882-1666","issn":["0882-1666","1520-684X"],"is_oa":false,"is_in_doaj":false,"is_core":true,"host_organization":"https://openalex.org/P4310320595","host_organization_name":"Wiley","host_organization_lineage":["https://openalex.org/P4310320595"],"host_organization_lineage_names":["Wiley"],"type":"journal"},"license":null,"license_id":null,"version":"publishedVersion","is_accepted":true,"is_published":true,"raw_source_name":"Systems and Computers in Japan","raw_type":"journal-article"}],"best_oa_location":null,"sustainable_development_goals":[{"score":0.7900000214576721,"id":"https://metadata.un.org/sdg/11","display_name":"Sustainable cities and communities"}],"awards":[],"funders":[],"has_content":{"pdf":false,"grobid_xml":false},"content_urls":null,"referenced_works_count":13,"referenced_works":["https://openalex.org/W235518815","https://openalex.org/W1574295218","https://openalex.org/W1800365115","https://openalex.org/W1893859273","https://openalex.org/W1988913686","https://openalex.org/W2108963403","https://openalex.org/W2123264590","https://openalex.org/W2125230399","https://openalex.org/W2128653836","https://openalex.org/W2130322773","https://openalex.org/W2142714559","https://openalex.org/W2150161081","https://openalex.org/W2157702502"],"related_works":["https://openalex.org/W2053269318","https://openalex.org/W2136763963","https://openalex.org/W2109705048","https://openalex.org/W2940588515","https://openalex.org/W1909151225","https://openalex.org/W3184123547","https://openalex.org/W2160030256","https://openalex.org/W4253235840","https://openalex.org/W3151937861","https://openalex.org/W2488941600"],"abstract_inverted_index":{"Abstract":[0],"This":[1],"paper":[2],"proposes":[3],"a":[4,10,26,50],"method":[5,107,119],"of":[6,28,37,52,67,116],"speech":[7,39,61],"recognition":[8,73],"in":[9,74,78,83,92],"nonstationary":[11],"noisy":[12,76],"environment,":[13],"combining":[14],"the":[15,19,23,35,38,41,46,60,63,65,75,90,93,100,103,114,117],"parallel":[16,110],"HMMs":[17,56],"and":[18,40,62,108],"spectral":[20,105],"subtraction.":[21],"In":[22],"proposed":[24,101,118],"method,":[25,102],"set":[27],"hypothesis":[29],"is":[30,87,120],"generated":[31],"with":[32],"respect":[33],"to":[34],"combination":[36],"noise":[42],"that":[43],"can":[44],"produce":[45],"observed":[47],"data":[48],"by":[49],"series":[51],"subtraction":[53,106],"processes.":[54],"Using":[55],"prepared":[57],"separately":[58],"for":[59,99],"noise,":[64],"probabilities":[66],"occurrence":[68],"are":[69,97],"calculated.":[70],"The":[71],"100\u2010word":[72],"environment":[77],"an":[79,84],"ordinary":[80,104],"car":[81],"running":[82],"urban":[85],"area,":[86],"defined":[88],"as":[89],"task":[91],"experiment.":[94],"Comparative":[95],"experiments,":[96],"made":[98],"other":[109],"HMM":[111],"methods.":[112],"Then,":[113],"effectiveness":[115],"verified.":[121]},"counts_by_year":[],"updated_date":"2025-11-06T03:46:38.306776","created_date":"2025-10-10T00:00:00"}
