Extraction is not in markdown #206

harinisri2001 · 2024-12-24T09:30:42Z

I tried to extract the contents of pdf. But it is extracting as plain text, not as markdown. Am I missing any parameter?

from markitdown import MarkItDown
md = MarkItDown()

result = md.convert("microsoft_report.pdf")
print(result.text_content)

output_file = "output.md"
with open(output_file, "w", encoding="utf-8") as file:
file.write(result.text_content)
print(f"Markdown content has been written to {output_file}")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extraction is not in markdown #206

Extraction is not in markdown #206

harinisri2001 commented Dec 24, 2024 •

edited

Loading

Extraction is not in markdown #206

Extraction is not in markdown #206

Comments

harinisri2001 commented Dec 24, 2024 • edited Loading

harinisri2001 commented Dec 24, 2024 •

edited

Loading