Bookmarks tagged llm and research

Extracting Prompts by Inverting LLM Outputs

7 Aug 2024 arxiv.org (Archive)

We consider the problem of language model inversion: given outputs of a language model, we seek to extract the prompt that generated these outputs. We develop a new black-box method, output2prompt, that learns to extract prompts without access to the model's logits and without adversarial or jailbreaking queries.

#ai + #gpt + #llm - #research -

Previous Page 1 of 1 Next