Bookmarks tagged ai and llm

9 Jun ml-site.cdn-apple.com (Archive)
Apple ML Research paper

Another related blog post: https://garymarcus.substack.com/p/a-knockout-blow-for-llms
We consider the problem of language model inversion: given outputs of a language model, we seek to extract the prompt that generated these outputs. We develop a new black-box method, output2prompt, that learns to extract prompts without access to the model's logits and without adversarial or jailbreaking queries.
A calculator that uses an LLM to solve equations
24 Oct 2023 www.lesswrong.com (Archive)
A mechanistic explanation of the Waluigi Effect and other bizarre "semiotic" phenomena which arise within large language models such as GPT-3/3.5/4 and their variants (ChatGPT, Sydney, etc). This article will be folklorish to some readers, and profoundly novel to others.