prompt | response |
---|---|
What are the benefits of LoRA finetuning? | LoRA allows for efficient parameter updates with reduced memory usage. |
How does attention work in transformers? | Attention enables models to weigh the importance of each input token in a sequence. |
prompt
: A question, prompt, or user utterance.response
: The ideal model response to the prompt.prompt | chosen | rejected |
---|---|---|
How do I deploy a model locally? | You can use the Remyx CLI to launch a local endpoint with the deploy command. | You just run it. It should work fine. |
What’s the capital of France? | The capital of France is Paris. | I think it’s either Madrid or Rome. |
prompt
: A question, prompt, or user utterance.chosen
: A high-quality response.rejected
: A lower-quality or less preferred response.