Bookmarks tagged ai
Apple ML Research paper
Another related blog post: https://garymarcus.substack.com/p/a-knockout-blow-for-llms
Another related blog post: https://garymarcus.substack.com/p/a-knockout-blow-for-llms
We consider the problem of language model inversion: given outputs of a language model, we seek to extract the prompt that generated these outputs. We develop a new black-box method, output2prompt, that learns to extract prompts without access to the model's logits and without adversarial or jailbreaking queries.
"Your brain does not process information, retrieve knowledge or store memories. In short: your brain is not a computer"
"Turn any glasses into hackable smart glasses with less than $25 of off-the-shelf components. Record your life, remember people you meet, identify objects, translate text, and more."
"1. Accurate Tone Color Cloning. OpenVoice can accurately clone the reference tone color and generate speech in multiple languages and accents.
2. Flexible Voice Style Control. OpenVoice enables granular control over voice styles, such as emotion and accent, as well as other style parameters including rhythm, pauses, and intonation.
3. Zero-shot Cross-lingual Voice Cloning. Neither of the language of the generated speech nor the language of the reference speech needs to be presented in the massive-speaker multi-lingual training dataset."
2. Flexible Voice Style Control. OpenVoice enables granular control over voice styles, such as emotion and accent, as well as other style parameters including rhythm, pauses, and intonation.
3. Zero-shot Cross-lingual Voice Cloning. Neither of the language of the generated speech nor the language of the reference speech needs to be presented in the massive-speaker multi-lingual training dataset."
A hackable AI home assistant platform using the Google Nest Mini (2nd gen) form factor, consisting of:
- a custom PCB designed to be a drop-in replacement to the original, using the ESP32-S3 for audio processing
- a server for handling the transcription, response generation and Text-to-Speech from multiple devices on the same network
- a custom PCB designed to be a drop-in replacement to the original, using the ESP32-S3 for audio processing
- a server for handling the transcription, response generation and Text-to-Speech from multiple devices on the same network