8000 How to get text without html tag from RecognitionPredictor? · Issue #739 · datalab-to/marker · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
How to get text without html tag from RecognitionPredictor? #739
Open
@littlehorse417

Description

@littlehorse417

Version

surya-ocr==0.14.5

Problem

RecognitionPredictor return text with html tag, such as <b>Abstract</b>. How to get text without html tag from RecognitionPredictor directly?

Code

predictions = recognition_predictor(images=[image], det_predictor=detection_predictor) 
result = [(text_line.text, text_line.bbox) for text_line in predictions[0].text_lines]
print(result)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0