REVAI-4324: Multichannel transcript grouping #119

dmtrrk · 2024-11-26T23:33:10Z

Overview

Add new optional parameters to get_transcript_xx functions that are available for multichannel media files:

group_channels_by
Specifies how to group multiple channels in the transcript. This parameter determines the atomic entity for breaking down the transcript into monologues. Only applicable when the submitted media has multiple channels (speaker_channels_count > 1):

speaker - groups by speakers
word - groups by individual words
sentence - groups by complete sentences

group_channels_threshold_ms
Threshold in milliseconds for handling speaker interruptions. When a speaker interrupts another speaker, this parameter determines how to group the segments. Only applicable when the submitted media has multiple channels (speaker_channels_count > 1):

If the interruption occurs within this threshold, preference is given to the most recent speaker
If the interruption occurs after this threshold, a new segment is created

Usage

from rev_ai import apiclient, GroupChannelsType

client = apiclient.RevAiAPIClient(token)
job = client.submit_job_local_file(filePath, speaker_channels_count=2)

# default (word, 1000ms)
transcript = client.get_transcript_text(job.id)

# specific WORD params
transcript_word = client.get_transcript_text(job.id, group_channels_by=GroupChannelsType.WORD, group_channels_threshold_ms=1000)

# specific Sentence parameters
transcript_sentence = client.get_transcript_text(job.id, group_channels_by=GroupChannelsType.SENTENCE, group_channels_threshold_ms=2000)

# Speaker parameter
transcript_speaker = client.get_transcript_text(job.id, group_channels_by=GroupChannelsType.SPEAKER)

alexsku · 2024-11-27T19:51:41Z

src/rev_ai/apiclient.py

@@ -337,95 +337,144 @@ def get_list_of_jobs(self, limit=None, starting_after=None):

        return [Job.from_json(job) for job in response.json()]

-    def get_transcript_text(self, id_):
+    def get_transcript_text(self, id_, group_channels_by=None, group_channels_threshold_ms=None):


i would use type hints if possible

I don't see we use type hints in this code. I consider this to be python 2.x compatibility

alexsku · 2024-11-27T19:52:51Z

src/rev_ai/apiclient.py

+        if group_channels_by is not None:
+            params.append('group_channels_by={}'.format(group_channels_by))
+        if group_channels_threshold_ms is not None:
+            params.append('group_channels_threshold_ms={}'.format(group_channels_threshold_ms))


@dmtrrk i think you were saying you have doubts about this, i thiunk this is right, we are dealing with these two parameters independently

alexsku

looks good to me

add parameters

020da79

dmtrrk changed the title ~~REVAI-4324:~~ REVAI-4324: Multichannel transcript grouping Nov 26, 2024

dmtrrk requested review from alexsku and eugenep-rev November 26, 2024 23:40

dmtrrk added 13 commits November 26, 2024 18:43

cleanup

f49454d

cleanup

3894377

cleanup

0e0f242

cleanup

4ba95f4

cleanup

7609478

add tests

bfc7424

add tests

95245a2

bump version

8307b3d

add type

041a3f3

fix exports

0064d86

fix exports

f7bb640

fix exports

285a797

fix exports

d39ea12

dmtrrk marked this pull request as ready for review November 27, 2024 19:15

dmtrrk requested a review from a team as a code owner November 27, 2024 19:15

alexsku reviewed Nov 27, 2024

View reviewed changes

alexsku approved these changes Nov 27, 2024

View reviewed changes

dmtrrk added 4 commits November 27, 2024 15:05

update docs

ee356bd

update docs

752bc94

update docs

1455ee3

update docs

5ca4836

dmtrrk merged commit e36130e into develop Nov 27, 2024
8 checks passed

dmtrrk deleted the feature/REVAI-4324 branch November 27, 2024 20:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REVAI-4324: Multichannel transcript grouping #119

REVAI-4324: Multichannel transcript grouping #119

dmtrrk commented Nov 26, 2024 •

edited

Loading

alexsku Nov 27, 2024

dmtrrk Nov 27, 2024

alexsku Nov 27, 2024

alexsku left a comment

REVAI-4324: Multichannel transcript grouping #119

REVAI-4324: Multichannel transcript grouping #119

Conversation

dmtrrk commented Nov 26, 2024 • edited Loading

Overview

Usage

alexsku Nov 27, 2024

Choose a reason for hiding this comment

dmtrrk Nov 27, 2024

Choose a reason for hiding this comment

alexsku Nov 27, 2024

Choose a reason for hiding this comment

alexsku left a comment

Choose a reason for hiding this comment

dmtrrk commented Nov 26, 2024 •

edited

Loading