Opale Technical Blog
Deep insights, practical solutions, and expert analysis for tech enthusiasts.

POLQA – Notes on Odd Behaviours (Part 4)

Author icon Olivier Willi   Calendar icon Thursday, 23rd June 2022

We added POLQA (ITU-T P.863) to our MultiDSLA product some seven months ago. Since then, we have been busy discovering what works and what doesn’t work quite so well in the new algorithm. Most of this learning comes from working with our early adopters of POLQA. We continue to gain further knowledge about using POLQA and how to understand results that don’t make sense. We thought it would be useful to share these odd behaviours over the coming weeks. Here is the third one.

 

4. Understand how POLQA time aligns signals before analysing surface views

POLQA and PESQ align reference and degraded signals in time using rather different philosophies. PESQ evaluates the degraded signal differences compared to the reference signal and moves parts of the degraded signal around to match the reference signal. The PESQ 2D surface view shows the degraded signal time aligned to the reference signal.

 

POLQA’s time alignment algorithm searches for sections of the reference file in the degraded file. This results in the parts of reference signal being moved to align with sections in the degraded signal. Consequently the POLQA 2D surface view shows the reference signal time aligned to the degraded signal. This can lead to the reference surface appearing to be significantly different from what was played into the network or device under test. The example below highlights the difference from an unusual measurement.

 

It would appear that there is a one second gap in the first utterance in the reference file according to POLQA. However, the reference surface looks normal in the PESQ view. In fact, the network delayed the last part of the first utterance by almost one second as shown in the waveform view below. The waveform view shows the reference file that has been generated by MultiDSLA for POLQA and the degraded file that has been captured by the receiving node. Neither has been altered before presentation in the waveform view.

 

Part 1: Don’t use more than 10s of speech in narrowband mode

Part 2: Ensure reference files pass the transparency test

Part 3: Expect reporting of small delay variations which are not actually there

Contact Opale Systems or your distributor for more information.

Open cookie management panel
Close panel
This site uses cookies to ensure its proper functioning. It also uses cookies from third party services to provide advanced functionality. At any time, you can choose which services you wish to activate or decide to withdraw your consent.
 
Customise accepted services
You are free to choose which services you wish to enable. By authorising these third party services, you agree to the deposit and reading of cookies and the use of tracking technologies necessary for their proper functioning. By withdrawing your consent for some of these services, some website features may no longer function.
Website navigation  Read more
The site writes a session cookie to enable it to function properly and to help with navigation. It cannot be deactivated.
Usage: 1 cookie, records the session identifier.
Time to live: The cookie is present during the entire session on the site. It becomes obsolete after 24 minutes of inactivity.
Mandatory
Media Popup
Display videos from Youtube or Dailymotion.
Google Analytics  Read more
Records website statistics.
 
Accept all Refuse all Manage
Follow us on LinkedIn Follow us on Youtube