We were pull test the 24awg wire using push pull gage
This type of test has a variety of repeatability problems. Trying to utilize Gage R&R on physical testing is a nightmare. I will give you some tips.
At you measuring ultimate, catastrophic failure? If so, put your pen down and walk away. It will never repeat. You may have to use "surrogate" measures to verify the tester. If it chart records, use yield instead of ultimate failure. It is far more repeatable in materials.
One approach I used was to use trilene fishing line at various test strengths as a surrogate material. It provides - by design - specific variation that your tester should be able to prove it can tell the difference between. You might not have those kinds of samples with your own material. I was working in a small range of pull strengths, so I could easily find line in my range. But, if you can use it in your tester (and you will need to learn the "trilene knot" to keep it from slipping), then input historical variation for your PV, it may work fine.
If your wire is stranded, try using different number of strands to determine of the tester can see the difference.
I hurts my brain when people want to rubber stamp Gage R&R on physical testing, but that rubber stamping is chronic, especially in automotive.