You only really need to collect lots of real responses if you want to grade sentences. For simpler cases pattern-match is quite easy if you know how.
And, the example you give is almost exactly the same as the SI units example in the docs: http://www.open.edu/openlearnworks/mod/oucontent/view.php?id=52747§ion=2.2.1