### Is there an existing issue for this? - [x] I have searched the existing issues ### Problem statement `DataGenerator` currently implements methods returning a Spark `DataFrame`. We rely on the calling code to persist generated data. Having internal methods to persist generated data will allow us to build end-to-end data generation workflows. ### Proposed Solution Add methods to persist generated data in files or tables. ### Additional Context _No response_