How to get text without html tag from RecognitionPredictor?

Version

surya-ocr==0.14.5

Problem

RecognitionPredictor return text with html tag, such as <b>Abstract</b>. How to get text without html tag from RecognitionPredictor directly?

Code

predictions = recognition_predictor(images=[image], det_predictor=detection_predictor) 
result = [(text_line.text, text_line.bbox) for text_line in predictions[0].text_lines]
print(result)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Version

Problem

Code

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Description

Version

Problem

Code

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions