**Is your feature request related to a problem? Please describe.** Annotation for scene text detection and scene text recognition. **Describe the solution you'd like** Bounding boxes for marking text regions, and a text-type label to enter the text on each bbox.