Abstrakti
Voice-controlled IoT nodes and wearable devices require integrated real-time ultra-low-power audio classification circuits to perform tasks such as Keyword Spotting (KWS) and Spoken Language Understanding (SLU). In conventional ADC+DSP implementations [1]-[2], the analog front-end (AFE) and digital feature extractor (FEx) together accounted for >50% of the system power. Analog FEx [3]-[9] reduces power by direct analog-to-feature conversion. Voltage-domain FEx [3]-[6] achieved <0.5μW power but only demonstrated <6 classes KWS. Time-domain FEx [7]-[9] achieved 86%-to-91.5% KWS accuracy with 10-to-12 classes but needed amplitude-normalized input or a costly off-chip classifier. In addition, prior designs [1]-[10] were limited to single-word audio inputs and did not consider continuous speech inputs required by SLU. Real-world operation also requires >60dB input range to cope with the variation of speech volume [11] and the speaker distance from the microphone.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | 2025 IEEE International Solid-State Circuits Conference, ISSCC 2025 |
Kustantaja | IEEE |
Sivut | 238-240 |
Sivumäärä | 3 |
ISBN (elektroninen) | 979-8-3315-4101-9 |
DOI - pysyväislinkit | |
Tila | Julkaistu - 2025 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | IEEE International Solid-State Circuits Conference - San Francisco, Yhdysvallat Kesto: 16 helmik. 2025 → 20 helmik. 2025 |
Conference
Conference | IEEE International Solid-State Circuits Conference |
---|---|
Maa/Alue | Yhdysvallat |
Kaupunki | San Francisco |
Ajanjakso | 16/02/2025 → 20/02/2025 |
Sormenjälki
Sukella tutkimusaiheisiin 'An 8.62μW 75dB-DRSoCEnd-to-End Spoken-Language-Understanding SoC with Channel-Level AGC and Temporal-Sparsity-Aware Streaming-Mode RNN'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Laitteet
-
Aalto Electronics-ICT
Ryynänen, J. (Manager)
Elektroniikan ja nanotekniikan laitosLaitteistot/tilat: Facility