All of the three suggestions you report are ok.
In addition, you could limit testing your prompts on a development set with fewer than 100 sentences, 50 sentences would still be ok for this project.
All of the three suggestions you report are ok.
In addition, you could limit testing your prompts on a development set with fewer than 100 sentences, 50 sentences would still be ok for this project.