Speaker identification WIP #46

etown · 2024-03-01T18:07:50Z

Adds person and voice sample models. Also adds an abstract identification service which we can implement for different embedding models/strategies.

Persons can be enrolled from the CLI, but we will be able to tag them from the UI as well. actual identification not implemented yet.

trzy · 2024-03-01T19:51:48Z

owl/models/schemas.py

+class VoiceSample(CreatedAtMixin, table=True):
+    id: Optional[int] = Field(default=None, primary_key=True)
+    filepath: str = Field(...)
+    speaker_embeddings: dict = Field(default={}, sa_column=Column(JSON))


If the key is embedding model can we name this explicitly speaker_embeddings_by_model?

Bakuutin · 2024-04-06T14:11:03Z

alembic/versions/b6aff0a993d7_add_person_and_voicesamples.py

+    # Use batch operations to support SQLite ALTER TABLE for adding constraints
+    with op.batch_alter_table('utterance', schema=None) as batch_op:
+        batch_op.add_column(sa.Column('person_id', sa.Integer(), nullable=True))
+        batch_op.create_foreign_key('fk_utterance_person', 'person', ['person_id'], ['id'])


Would it make sense to store the vector embedding of the voice here? This way, we would be able to

Show distinct speakers in the UI even without having the Persons in the DB

On creating a new person, easily find all the instances in the past when that person spoke by fetching all the utterances with similar enough voice embeddings

etown added 3 commits March 1, 2024 10:04

Add person and voice sample model

0a1acba

Enroll speaker from CLI

8444ed7

Speaker identification service

ff584f6

trzy reviewed Mar 1, 2024

View reviewed changes

Stub identification service

7467cb8

Bakuutin reviewed Apr 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speaker identification WIP #46

Speaker identification WIP #46

Uh oh!

etown commented Mar 1, 2024

Uh oh!

trzy Mar 1, 2024

Uh oh!

Bakuutin Apr 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Speaker identification WIP #46

Are you sure you want to change the base?

Speaker identification WIP #46

Uh oh!

Conversation

etown commented Mar 1, 2024

Uh oh!

trzy Mar 1, 2024

Choose a reason for hiding this comment

Uh oh!

Bakuutin Apr 6, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants