{"id":"https://openalex.org/W1988195642","doi":"https://doi.org/10.1002/scj.4690270207","title":"A speaker\u2010adaptation technique for context\u2010dependent models represented by hidden markov networks","display_name":"A speaker\u2010adaptation technique for context\u2010dependent models represented by hidden markov networks","publication_year":1996,"publication_date":"1996-01-01","ids":{"openalex":"https://openalex.org/W1988195642","doi":"https://doi.org/10.1002/scj.4690270207","mag":"1988195642"},"language":"en","primary_location":{"id":"doi:10.1002/scj.4690270207","is_oa":false,"landing_page_url":"https://doi.org/10.1002/scj.4690270207","pdf_url":null,"source":{"id":"https://openalex.org/S58208175","display_name":"Systems and Computers in Japan","issn_l":"0882-1666","issn":["0882-1666","1520-684X"],"is_oa":false,"is_in_doaj":false,"is_core":true,"host_organization":"https://openalex.org/P4310320595","host_organization_name":"Wiley","host_organization_lineage":["https://openalex.org/P4310320595"],"host_organization_lineage_names":["Wiley"],"type":"journal"},"license":null,"license_id":null,"version":"publishedVersion","is_accepted":true,"is_published":true,"raw_source_name":"Systems and Computers in Japan","raw_type":"journal-article"},"type":"article","indexed_in":["crossref"],"open_access":{"is_oa":false,"oa_status":"closed","oa_url":null,"any_repository_has_fulltext":false},"authorships":[{"author_position":"first","author":{"id":"https://openalex.org/A5029205877","display_name":"Jun-ichi Takami","orcid":null},"institutions":[{"id":"https://openalex.org/I4210104143","display_name":"Advanced Telecommunications Research Institute International","ror":"https://ror.org/01pe1d703","country_code":"JP","type":"facility","lineage":["https://openalex.org/I4210104143"]}],"countries":["JP"],"is_corresponding":true,"raw_author_name":"Jun\u2010Ichi Takami","raw_affiliation_strings":["ATR Interpreting Telecommunications Research Laboratories, Kyoto, Japan 619","Jun-ichi Takami graduated in 1984 for the Dept. Acoust. Design, Kyushu Univ. Art. Eng., and affiliatedwith Japan Victor Co. He is engaged at Acoust. Tech. Lab. (at present, Central Tech. Lab.) in research on digital signal processing. Dispatched to ATR Interpret. Telecom. Res. Lab. in 1989 and engaged in research on speech recognition, as a researcher in Speech Inf. Proc. Lab. Returned to Japan Victor Co. Central Res. Lab. In 1994, Recipient of Develop. Award in 1994 from Acoust. Soc. Jap. He is a member of Acoust. Soc. Japan","Presently, with Japan Victor Co"],"affiliations":[{"raw_affiliation_string":"ATR Interpreting Telecommunications Research Laboratories, Kyoto, Japan 619","institution_ids":["https://openalex.org/I4210104143"]},{"raw_affiliation_string":"Jun-ichi Takami graduated in 1984 for the Dept. Acoust. Design, Kyushu Univ. Art. Eng., and affiliatedwith Japan Victor Co. He is engaged at Acoust. Tech. Lab. (at present, Central Tech. Lab.) in research on digital signal processing. Dispatched to ATR Interpret. Telecom. Res. Lab. in 1989 and engaged in research on speech recognition, as a researcher in Speech Inf. Proc. Lab. Returned to Japan Victor Co. Central Res. Lab. In 1994, Recipient of Develop. Award in 1994 from Acoust. Soc. Jap. He is a member of Acoust. Soc. Japan","institution_ids":[]},{"raw_affiliation_string":"Presently, with Japan Victor Co","institution_ids":[]}]},{"author_position":"last","author":{"id":"https://openalex.org/A5003450550","display_name":"Shigeki Sagayama","orcid":null},"institutions":[{"id":"https://openalex.org/I2251713219","display_name":"NTT (Japan)","ror":"https://ror.org/00berct97","country_code":"JP","type":"company","lineage":["https://openalex.org/I2251713219"]}],"countries":["JP"],"is_corresponding":false,"raw_author_name":"Shigeki Sagayama","raw_affiliation_strings":["NTT Human Interface Laboratories, Yokosuka, Japan, 238","Shigeki Sagayama graduated in 1972 from Dept. Phys. Instr., the Univ. of Tokyo, where he received his Master's degree in 1974 and affiliatedwith NIT. He engaged at Musashino Elect. Comm. Lab. and Human Interface Lab. in research on speech information processing. Head of Speech Inf. Proc. Lab. in 1990 in ATR Interpret. Telecomm. Res. Lab. He is engaged at present in research on speech recognition, speech synthesis, and automatic interpreting telephone. Leamesearcher since 1993, NTT Human Interface Lab. Recipient of Invention Award in 1990, Tech. Develop. Award in 1994 from Acoust. Soc. Japan. He a member of Acoust. Soc. Jap.; IEEE; and AVIRG","Shigeki Sagayama graduated in 1972 from Dept. Phys. Instr., the Univ. of Tokyo, where he received his Master's degree in 1974 and affiliatedwith NIT. He engaged at Musashino Elect. Comm. Lab. and Human Interface Lab. in research on speech information processing. Head of Speech Inf. Proc. Lab. in 1990 in ATR Interpret. Telecomm. Res. Lab. He is engaged at present in research on speech recognition, speech synthesis, and automatic interpreting telephone. Leamesearcher since 1993, NTT Human Interface Lab. Recipient of Invention Award in 1990, Tech. Develop. Award in 1994 from Acoust. Soc. Japan. He a member of Acoust. Soc. Jap","AVIRG"],"affiliations":[{"raw_affiliation_string":"NTT Human Interface Laboratories, Yokosuka, Japan, 238","institution_ids":["https://openalex.org/I2251713219"]},{"raw_affiliation_string":"Shigeki Sagayama graduated in 1972 from Dept. Phys. Instr., the Univ. of Tokyo, where he received his Master's degree in 1974 and affiliatedwith NIT. He engaged at Musashino Elect. Comm. Lab. and Human Interface Lab. in research on speech information processing. Head of Speech Inf. Proc. Lab. in 1990 in ATR Interpret. Telecomm. Res. Lab. He is engaged at present in research on speech recognition, speech synthesis, and automatic interpreting telephone. Leamesearcher since 1993, NTT Human Interface Lab. Recipient of Invention Award in 1990, Tech. Develop. Award in 1994 from Acoust. Soc. Japan. He a member of Acoust. Soc. Jap.; IEEE; and AVIRG","institution_ids":[]},{"raw_affiliation_string":"Shigeki Sagayama graduated in 1972 from Dept. Phys. Instr., the Univ. of Tokyo, where he received his Master's degree in 1974 and affiliatedwith NIT. He engaged at Musashino Elect. Comm. Lab. and Human Interface Lab. in research on speech information processing. Head of Speech Inf. Proc. Lab. in 1990 in ATR Interpret. Telecomm. Res. Lab. He is engaged at present in research on speech recognition, speech synthesis, and automatic interpreting telephone. Leamesearcher since 1993, NTT Human Interface Lab. Recipient of Invention Award in 1990, Tech. Develop. Award in 1994 from Acoust. Soc. Japan. He a member of Acoust. Soc. Jap","institution_ids":[]},{"raw_affiliation_string":"AVIRG","institution_ids":[]}]}],"institutions":[],"countries_distinct_count":1,"institutions_distinct_count":2,"corresponding_author_ids":["https://openalex.org/A5029205877"],"corresponding_institution_ids":["https://openalex.org/I4210104143"],"apc_list":null,"apc_paid":null,"fwci":0.0,"has_fulltext":false,"cited_by_count":0,"citation_normalized_percentile":{"value":0.1179402,"is_in_top_1_percent":false,"is_in_top_10_percent":false},"cited_by_percentile_year":null,"biblio":{"volume":"27","issue":"2","first_page":"75","last_page":"86"},"is_retracted":false,"is_paratext":false,"is_xpac":false,"primary_topic":{"id":"https://openalex.org/T10201","display_name":"Speech Recognition and Synthesis","score":0.9965999722480774,"subfield":{"id":"https://openalex.org/subfields/1702","display_name":"Artificial Intelligence"},"field":{"id":"https://openalex.org/fields/17","display_name":"Computer Science"},"domain":{"id":"https://openalex.org/domains/3","display_name":"Physical Sciences"}},"topics":[{"id":"https://openalex.org/T10201","display_name":"Speech Recognition and Synthesis","score":0.9965999722480774,"subfield":{"id":"https://openalex.org/subfields/1702","display_name":"Artificial Intelligence"},"field":{"id":"https://openalex.org/fields/17","display_name":"Computer Science"},"domain":{"id":"https://openalex.org/domains/3","display_name":"Physical Sciences"}},{"id":"https://openalex.org/T10860","display_name":"Speech and Audio Processing","score":0.9896000027656555,"subfield":{"id":"https://openalex.org/subfields/1711","display_name":"Signal Processing"},"field":{"id":"https://openalex.org/fields/17","display_name":"Computer Science"},"domain":{"id":"https://openalex.org/domains/3","display_name":"Physical Sciences"}},{"id":"https://openalex.org/T11309","display_name":"Music and Audio Processing","score":0.9657999873161316,"subfield":{"id":"https://openalex.org/subfields/1711","display_name":"Signal Processing"},"field":{"id":"https://openalex.org/fields/17","display_name":"Computer Science"},"domain":{"id":"https://openalex.org/domains/3","display_name":"Physical Sciences"}}],"keywords":[{"id":"https://openalex.org/keywords/computer-science","display_name":"Computer science","score":0.8045058250427246},{"id":"https://openalex.org/keywords/hidden-markov-model","display_name":"Hidden Markov model","score":0.7567148208618164},{"id":"https://openalex.org/keywords/speech-recognition","display_name":"Speech recognition","score":0.734001874923706},{"id":"https://openalex.org/keywords/adaptation","display_name":"Adaptation (eye)","score":0.6909604072570801},{"id":"https://openalex.org/keywords/context","display_name":"Context (archaeology)","score":0.5114473104476929},{"id":"https://openalex.org/keywords/speaker-recognition","display_name":"Speaker recognition","score":0.4913868010044098},{"id":"https://openalex.org/keywords/smoothing","display_name":"Smoothing","score":0.461504727602005},{"id":"https://openalex.org/keywords/realization","display_name":"Realization (probability)","score":0.46120232343673706},{"id":"https://openalex.org/keywords/supervisor","display_name":"Supervisor","score":0.41407281160354614},{"id":"https://openalex.org/keywords/pattern-recognition","display_name":"Pattern recognition (psychology)","score":0.39913833141326904},{"id":"https://openalex.org/keywords/artificial-intelligence","display_name":"Artificial intelligence","score":0.38430359959602356},{"id":"https://openalex.org/keywords/mathematics","display_name":"Mathematics","score":0.11630034446716309},{"id":"https://openalex.org/keywords/statistics","display_name":"Statistics","score":0.11441200971603394}],"concepts":[{"id":"https://openalex.org/C41008148","wikidata":"https://www.wikidata.org/wiki/Q21198","display_name":"Computer science","level":0,"score":0.8045058250427246},{"id":"https://openalex.org/C23224414","wikidata":"https://www.wikidata.org/wiki/Q176769","display_name":"Hidden Markov model","level":2,"score":0.7567148208618164},{"id":"https://openalex.org/C28490314","wikidata":"https://www.wikidata.org/wiki/Q189436","display_name":"Speech recognition","level":1,"score":0.734001874923706},{"id":"https://openalex.org/C139807058","wikidata":"https://www.wikidata.org/wiki/Q352374","display_name":"Adaptation (eye)","level":2,"score":0.6909604072570801},{"id":"https://openalex.org/C2779343474","wikidata":"https://www.wikidata.org/wiki/Q3109175","display_name":"Context (archaeology)","level":2,"score":0.5114473104476929},{"id":"https://openalex.org/C133892786","wikidata":"https://www.wikidata.org/wiki/Q1145189","display_name":"Speaker recognition","level":2,"score":0.4913868010044098},{"id":"https://openalex.org/C3770464","wikidata":"https://www.wikidata.org/wiki/Q775963","display_name":"Smoothing","level":2,"score":0.461504727602005},{"id":"https://openalex.org/C2781089630","wikidata":"https://www.wikidata.org/wiki/Q21856745","display_name":"Realization (probability)","level":2,"score":0.46120232343673706},{"id":"https://openalex.org/C2779110517","wikidata":"https://www.wikidata.org/wiki/Q1240788","display_name":"Supervisor","level":2,"score":0.41407281160354614},{"id":"https://openalex.org/C153180895","wikidata":"https://www.wikidata.org/wiki/Q7148389","display_name":"Pattern recognition (psychology)","level":2,"score":0.39913833141326904},{"id":"https://openalex.org/C154945302","wikidata":"https://www.wikidata.org/wiki/Q11660","display_name":"Artificial intelligence","level":1,"score":0.38430359959602356},{"id":"https://openalex.org/C33923547","wikidata":"https://www.wikidata.org/wiki/Q395","display_name":"Mathematics","level":0,"score":0.11630034446716309},{"id":"https://openalex.org/C105795698","wikidata":"https://www.wikidata.org/wiki/Q12483","display_name":"Statistics","level":1,"score":0.11441200971603394},{"id":"https://openalex.org/C120665830","wikidata":"https://www.wikidata.org/wiki/Q14620","display_name":"Optics","level":1,"score":0.0},{"id":"https://openalex.org/C199539241","wikidata":"https://www.wikidata.org/wiki/Q7748","display_name":"Law","level":1,"score":0.0},{"id":"https://openalex.org/C86803240","wikidata":"https://www.wikidata.org/wiki/Q420","display_name":"Biology","level":0,"score":0.0},{"id":"https://openalex.org/C31972630","wikidata":"https://www.wikidata.org/wiki/Q844240","display_name":"Computer vision","level":1,"score":0.0},{"id":"https://openalex.org/C121332964","wikidata":"https://www.wikidata.org/wiki/Q413","display_name":"Physics","level":0,"score":0.0},{"id":"https://openalex.org/C17744445","wikidata":"https://www.wikidata.org/wiki/Q36442","display_name":"Political science","level":0,"score":0.0},{"id":"https://openalex.org/C151730666","wikidata":"https://www.wikidata.org/wiki/Q7205","display_name":"Paleontology","level":1,"score":0.0}],"mesh":[],"locations_count":1,"locations":[{"id":"doi:10.1002/scj.4690270207","is_oa":false,"landing_page_url":"https://doi.org/10.1002/scj.4690270207","pdf_url":null,"source":{"id":"https://openalex.org/S58208175","display_name":"Systems and Computers in Japan","issn_l":"0882-1666","issn":["0882-1666","1520-684X"],"is_oa":false,"is_in_doaj":false,"is_core":true,"host_organization":"https://openalex.org/P4310320595","host_organization_name":"Wiley","host_organization_lineage":["https://openalex.org/P4310320595"],"host_organization_lineage_names":["Wiley"],"type":"journal"},"license":null,"license_id":null,"version":"publishedVersion","is_accepted":true,"is_published":true,"raw_source_name":"Systems and Computers in Japan","raw_type":"journal-article"}],"best_oa_location":null,"sustainable_development_goals":[],"awards":[],"funders":[],"has_content":{"grobid_xml":false,"pdf":false},"content_urls":null,"referenced_works_count":12,"referenced_works":["https://openalex.org/W107955645","https://openalex.org/W298176830","https://openalex.org/W1920769845","https://openalex.org/W1965201650","https://openalex.org/W2074126259","https://openalex.org/W2100551412","https://openalex.org/W2110007337","https://openalex.org/W2131513205","https://openalex.org/W2158791054","https://openalex.org/W2165357419","https://openalex.org/W4235245715","https://openalex.org/W6683380142"],"related_works":["https://openalex.org/W4387731985","https://openalex.org/W2755149878","https://openalex.org/W2941808082","https://openalex.org/W2468425257","https://openalex.org/W2528721242","https://openalex.org/W4377009725","https://openalex.org/W4320164562","https://openalex.org/W2356364326","https://openalex.org/W4379086698","https://openalex.org/W2372069567"],"abstract_inverted_index":{"Abstract":[0],"This":[1,21,136],"study":[2],"aims":[3],"at":[4],"the":[5,15,24,72,76,89,103,112,124,127,141,144,151,157,160,163,178,181],"realization":[6],"of":[7,33,40,62,75,99,111,154,159,180],"a":[8,19,30,37,59,80,85,96],"speaker\u2010independent":[9],"speech":[10,41],"recognition":[11,121],"system":[12],"based":[13],"on":[14],"speaker":[16,164,168,182],"adaptation":[17,46,125],"with":[18,150],"supervisor.":[20],"paper":[22,83],"describes":[23],"highly":[25],"accurate":[26],"speaker\u2010adaptation":[27,86],"technique":[28,87],"using":[29,88],"small":[31,38,152],"number":[32,39,61,98,153],"training":[34],"samples.":[35,77],"When":[36],"samples":[42,155],"are":[43],"used":[44],"for":[45,143],"there":[47],"arise":[48],"problems":[49],"that":[50],"sufficient":[51],"information":[52],"cannot":[53],"be":[54],"obtained":[55],"to":[56,71,148,176],"update":[57],"simultaneously":[58,140],"large":[60],"model":[63,100,146],"parameters,":[64],"and":[65,156],"an":[66,117],"estimation":[67,161],"error":[68],"is":[69,109,134,172],"included":[70],"statistical":[73],"bias":[74],"From":[78],"such":[79],"viewpoint,":[81],"this":[82],"proposes":[84],"hidden":[90],"Markov":[91],"network":[92],"(HMnet),":[93],"which":[94,108],"employs":[95],"smaller":[97],"parameters":[101,147],"than":[102],"mixed":[104],"continuous\u2010distributed":[105],"phoneme":[106,113],"HMM,":[107],"independent":[110],"context,":[114],"while":[115],"realizing":[116],"equal":[118],"or":[119],"better":[120],"performance.":[122],"As":[123],"technique,":[126],"moving":[128],"vector":[129],"field":[130],"smoothing":[131],"(VFS)":[132],"method":[133,137,170],"used.":[135],"can":[138],"realize":[139],"interpolation":[142],"unadapted":[145],"cope":[149],"correction":[158],"in":[162,174],"adaptation.":[165,183],"The":[166],"standard":[167],"pre\u2010selection":[169],"also":[171],"investigated":[173],"order":[175],"improve":[177],"accuracy":[179]},"counts_by_year":[],"updated_date":"2025-11-06T03:46:38.306776","created_date":"2025-10-10T00:00:00"}
