Active Speaker Localization
Active Speaker Localization (ASL) is the process of spatially localizing an active speaker (talker) in an environment using either audio, vision or both.
Papers
Showing 1–5 of 5 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | AV (cor+eng+box) | ASL mAP | 0.86 | — | Unverified |