Ai dictation mode #48

C-Loftus · 2024-03-28T18:09:49Z

Use system accessibility APIs to dynamically get the proper context and automatically fix dictation and all punctuation as you speak it

for more information, see https://pre-commit.ci

C-Loftus · 2024-03-28T18:13:06Z

@jaresty Opinions on this? I want to try and make it so that we can pass a lot of context to the model and that we can use Talon for the baseline speech to text and then we can still get the more specific formatting we want on stuff by using the model to fix up things like proper nouns and/or punctuation.

C-Loftus · 2024-03-28T18:18:31Z

We can also create model select or something similar to select a range in an editable text box by passing all the context to the model having it return the range, so we wouldn't need to highlight it. I think there is a ton of potential with accessibility APIs in general, but unfortunately this does mean some OS or beta/public release talon fragmentation.

jaresty · 2024-03-28T19:13:22Z

I think this is a great idea. One less step to correct dictation!

4b11b4 · 2024-04-28T17:34:10Z

This is a rough idea but is there someway to leverage the work from https://github.com/OpenInterpreter/open-interpreter @C-Loftus

C-Loftus · 2024-04-29T00:26:31Z

This is a rough idea but is there someway to leverage the work from https://github.com/OpenInterpreter/open-interpreter @C-Loftus

Just curious do you have specific features in that repo you are looking for? @4b11b4 I am somewhat familiar with that, but not the specifics. This repo should have many of the same features but for voice. Since Talon packages in general are intended not to use external libraries, I've implemented most stuff from scratch.

For context (either you or anyone viewing this, this PR is sort of blocked at the moment since it relies upon Talon's accessibility bindings which aren't really documented and have dependencies on an underlying Rust library that sometimes doesn't behave as intended. Without being able to use these apis to pass additional surrounding context, real-time AI dictation fixes aren't particularly useful and it is just better to use model fix grammar as it is currently implemented

Let me know if you have other ideas or I am overlooking something you think could help this situation

C-Loftus · 2024-07-19T15:29:41Z

Closing this since it isn't really practical imo. Better to just use copilot or codeium. And axkit handles simpler context aware punctuation well on its own for macos, which would've been a big use case for this.

C-Loftus and others added 10 commits March 18, 2024 13:58

remove 310

e92d242

Merge branch 'main' of https://github.com/C-Loftus/talon-gpt

da4d123

emoji formatting

32a43d2

fix plural

bfc2eb0

begin natural language dictation

d96e1f3

Merge branch 'main' of https://github.com/C-Loftus/talon-gpt

4412b4c

better ai dictation

d2cf402

[pre-commit.ci] auto fixes from pre-commit.com hooks

e41e77c

for more information, see https://pre-commit.ci

Merge branch 'main' into test

c97e84c

Merge branch 'ai-dictation-mode' of https://github.com/C-Loftus/talon…

ce543f4

…-gpt into test

C-Loftus marked this pull request as draft March 28, 2024 18:11

a few minor prompt edits

8a2c847

C-Loftus closed this Jul 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ai dictation mode #48

Ai dictation mode #48

C-Loftus commented Mar 28, 2024

C-Loftus commented Mar 28, 2024

C-Loftus commented Mar 28, 2024

jaresty commented Mar 28, 2024 •

edited

Loading

4b11b4 commented Apr 28, 2024

C-Loftus commented Apr 29, 2024

C-Loftus commented Jul 19, 2024

Ai dictation mode #48

Ai dictation mode #48

Conversation

C-Loftus commented Mar 28, 2024

C-Loftus commented Mar 28, 2024

C-Loftus commented Mar 28, 2024

jaresty commented Mar 28, 2024 • edited Loading

4b11b4 commented Apr 28, 2024

C-Loftus commented Apr 29, 2024

C-Loftus commented Jul 19, 2024

jaresty commented Mar 28, 2024 •

edited

Loading