“Matlab Forever!" (Audio Signal Processing Project)

4 Ansichten (letzte 30 Tage)
Pete M
Pete M am 23 Feb. 2021
Kommentiert: Pete M am 2 Mär. 2021
Broad Aim: When did someone say something on a TV program?
Viewers of a TV program (e.g. game show contestants answering timed questions) believe they are watching the same content at the same time as other viewers across the country. But due to transmission delays, they are actually watching a given part of the program seconds or minutes apart. This time difference between the original source material (audio reference file) and the viewed material (the user’s audio file) is what I am trying to determine using Matlab.
The Application: Using a phone, a user points their phone to the sound coming from a TV in a normal living room where there would be extraneous noises apart from the sound (program content) coming from the TV. The user records a sample of a few seconds at the beginning of the program. This captured sound sample is transmitted back to the server where it is compared against a reference audio file of the same content. The aim is to determine exactly where the viewer is up to in the TV program timeline.
Example
Stage 1: Each user is invited to grab a sound sample of a few seconds of the program material at the very beginning of the program. This is relayed back to the server as the user's initial program time.
Stage 2: At 5 minutes, 32 seconds from the beginning of the program (an arbitrary start point) the actor raises his hand and declares "Matlab Forever!".
User 1: Views this scene at 00:05:34 and acknowledges this cue by tapping a button on his phone UI.
User 2: Views this scene at 00:05:36, ditto
User 3: Views this scene at 00:05:42, ditto
The system now knows that users 1, 2, and 3 have delays of +2, +4, and +10 seconds respectively from when the material was broadcast. Mission accomplished.
Advice Sought
1) What is the level of difficulty of this project for someone who is not an engineer, does not have a mathematical background, no programming experience? I have never used Matlab before. Will download a trial shortly. That said, I do have some experience with audio, music, sound processing.
2) It's hard for me to gauge the feasibility of being able to pull this off. No doubt the community can help with the occasional problem, which is great, but I imagine there will be parts of the project where I need direct help. I am happy to pay for a few hours here and there, as that would seem fair. (Not sure if the community rules allow this) Anyone interested on that basis?
3) Can this project be done with these three Matlab tools? Signal Processing Toolbox, Signal Analyzer, Audio Toolbox. Perhaps Simulink too?
4) To kickstart this, any methodology tips would be great appreciated.
Thanks!
  5 Kommentare
Pete M
Pete M am 2 Mär. 2021
Yes @Walter Roberson. I have studied articles on 'audio fingerprinting', which explain how Shazam is implemented, and that looks to be a smart approach to recognizing the audio segment for comparison against the reference file.
You suggest "cross-correlation" to identify the lags (which is critical to making it work.) So that is a good pointer. In theory I understand what you mean here, but I have no practical experience doing these things. Anyway, it is good enough for now to map out a methodology. Thank you.
Pete M
Pete M am 2 Mär. 2021
@Steven Lord, quite frankly I am looking at a very high level approach to start with. Although I have no direct experience with MATLAB, I know I would need a programmed solution and believe that MATLAB would be suitable. Even better if I can find someone to help with the project, fees for work etc. Thanks.

Melden Sie sich an, um zu kommentieren.

Antworten (1)

Pete M
Pete M am 28 Feb. 2021
No responses. Perhaps this question is too basic? Or too broad?

Kategorien

Mehr zu Audio I/O and Waveform Generation finden Sie in Help Center und File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by