Which of the following scenarios would result in a more reliable scientific claim?

Testing a question multiple times and obtaining the same results.
Testing a question multiple times and obtaining different results.
Testing a question once, and having another scientist replicate your experiment.
Testing a question multiple times, and having another scientist replicate your experiment.