A powerful Logseq plugin that transcribes voice notes and automatically creates tasks in Todoist using OpenAI's Whisper and GPT models.
Logseq's mobile app records audio in AAC format by default. Since OpenAI's Whisper API doesn't support AAC files directly, you'll need to run the Audio Converter Server to process these files.
The logseq-audio-converter-server provides automatic AAC to M4A conversion for mobile recordings.
-
Install Prerequisites:
- Python 3.8+
- FFmpeg (Download)
-
Clone and Install:
git clone https://github.com/GonzRon/logseq-audio-converter-server cd logseq-audio-converter-server pip install -r requirements.txt
-
Run the Server:
python aac_converter_server.py
The server runs on
http://127.0.0.1:3456
by default. -
Configure in VoiceFlow Settings:
- AAC Converter Host:
127.0.0.1
- AAC Converter Port:
3456
- AAC Converter Host:
Note: Desktop users recording in MP3/M4A/WAV formats don't need the converter server.
- Voice Transcription: Convert audio files to text using OpenAI's Whisper API
- AI Summarization: Generate intelligent summaries and extract actionable tasks
- Direct Todoist Integration: Create tasks without external dependencies
- Smart Tag Detection: Automatically trigger actions using spoken hashtags
- Hierarchical Tasks: Create master tasks with subtasks for complex projects
- Due Date Recognition: Natural language processing for deadlines
- Mobile Support: Full AAC audio support with converter server
- Download the plugin from the Logseq marketplace (or clone from GitHub)
- Enable the plugin in Logseq settings
- Get your API key from OpenAI Platform
- Open plugin settings in Logseq (βοΈ β Plugins β VoiceFlow Automate)
- Enter your OpenAI API key
- Select your preferred model (GPT-4o recommended)
- Test the connection
- Get your API token from Todoist Settings
- Enter the token in plugin settings
- Test the connection
- Configure project mappings (optional)
- Follow the Audio Converter Server Setup above
- Configure converter settings in plugin
- Test with a sample AAC recording
-
Record Audio:
- Mobile: Use Logseq's built-in recorder (creates AAC files)
- Desktop: Use any audio recorder plugin (MP3/M4A/WAV)
-
Process Voice Note:
- Place cursor on the block with audio file
- Use slash command
/voiceflow
or keyboard shortcutCmd/Ctrl+Shift+V
-
Automatic Processing:
- Transcribes the audio
- Creates AI summary (if enabled)
- Extracts and creates tasks in Todoist (if #todo spoken)
- Generates a new Logseq page with everything
Use these spoken hashtags in your voice notes to trigger actions:
"hashtag todo"
or"hashtag task"
- Create task(s) in Todoist"hashtag to-do"
- Alternative spelling supported
"hashtag ai"
- Use AI summarization (overrides default)"hashtag direct"
or"hashtag literal"
- Use literal transcription
"hashtag [project-tag]"
- Route to specific Todoist project- Example:
"hashtag work"
routes to your Work project
"hashtag urgent"
or"hashtag high"
- Set priority 4"hashtag medium"
- Set priority 3"hashtag low"
- Set priority 1
"due date tomorrow"
- Sets due date to tomorrow"due date next Friday"
- Natural language date parsing"due date January 15th"
- Specific dates supported
"Remember to call the dentist tomorrow, hashtag todo"
Creates: Single task "call the dentist" with due date tomorrow
"Plan the office party: book venue, order catering, send invitations,
arrange decorations, hashtag todo hashtag ai hashtag work"
Creates:
- Master task: "Plan Office Party" (in Work project)
- Subtasks: Book venue, Order catering, Send invitations, Arrange decorations
"Finish the quarterly report, due date Friday, hashtag todo hashtag urgent"
Creates: High-priority task with Friday deadline
Map spoken hashtags to Todoist projects in the UI:
- Click "Load Projects for Mapping" button
- Edit the generated JSON to customize tags:
{
"#work": {"id": "2234567890", "name": "Work"},
"#personal": {"id": "2234567891", "name": "Personal"},
"#health": {"id": "2234567892", "name": "Health"}
}
Customize how AI processes your voice notes by configuring custom prompts in settings.
- Literal: Exact transcription β single task
- AI: Intelligent processing β multiple structured tasks
- Ensure Audio Converter Server is running
- Check converter host/port settings match server
- Verify FFmpeg is installed correctly
- Server logs: Check
aac_converter.log
- With Converter: AAC, MP3, M4A, WAV, WebM
- Without Converter: MP3, M4A, WAV, WebM (no AAC)
- Verify API token is correct
- Test connection in settings
- Check project mappings are valid
- Ensure you spoke a trigger tag (#todo)
- Check browser console for errors
- 401: Invalid API key
- 429: Rate limit or quota exceeded
- 400: Check audio file format
- Network Error: Check internet connection
- Connection Refused: Start the converter server
- Port Already in Use: Change port in both server and plugin settings
- FFmpeg Not Found: Install FFmpeg and check PATH
- Record voice note in Logseq mobile
- Ensure converter server is running on your computer
- Sync to desktop
- Process with VoiceFlow
- Record with audio plugin (MP3/M4A format)
- Process immediately with VoiceFlow
- No converter needed for non-AAC files
- Quick Capture: Record β Add
#todo
β Auto-process - Batch Processing: Record multiple items β Use AI mode for smart extraction
- Project Organization: Set up project mappings for automatic routing
- Meeting Notes: Record meeting β AI summarizes β Tasks created automatically
- Direct Todoist Integration: No external plugins required
- Enhanced Tag Detection: Better recognition of spoken hashtags
- Hierarchical Task Support: Create structured task lists
- Due Date Recognition: Natural language date parsing
- AAC Support: Full mobile audio support with converter server
- Improved AI Processing: Better task extraction with GPT-4
- Transcription Speed: 10-30 seconds for typical voice notes
- AAC Conversion: 1-3 seconds via converter server
- AI Processing: 2-5 seconds for summary and task extraction
- File Size Limits: 25MB for audio files (configurable in converter)
- Local Processing: AAC conversion happens on your machine
- API Security: Keys stored locally in Logseq settings
- No Data Storage: Converter server doesn't store files
- Temporary Files: Auto-cleaned after processing
MIT License - See LICENSE file for details
Contributions welcome! Please submit issues and pull requests on GitHub.
- logseq-audio-converter-server - AAC conversion service
- Logseq - Privacy-first knowledge base
- OpenAI for Whisper and GPT APIs
- Todoist for their excellent API
- FFmpeg for audio conversion capabilities
- Logseq community for feedback and support
For issues:
- Plugin Issues: Open issue on GitHub
- Converter Issues: Check converter repo
- API Issues: Check OpenAI/Todoist documentation
Enable Debug Mode in settings for detailed logging in browser console (Ctrl+Shift+I
).