I've done something similar at work, primarily to generate random data that'll pass our validation checks (real cities, valid SSN, etc).<p>It seems strange to me that the email address and usernames in the samples are based on a person's name, but they're different from the user's actual name.<p>I chose to just take the name, and transform it for the email address. I think some consistency will make it easier to test that the right user data shows up in the right places while testing.
I've used Faker and like it. For mocking big datasets I prefer <a href="http://www.mockaroo.com/" rel="nofollow">http://www.mockaroo.com/</a> served by <a href="https://github.com/typicode/json-server" rel="nofollow">https://github.com/typicode/json-server</a>