-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Insights: microsoft/markitdown
Overview
Could not load contribution data
Please try again later
7 Pull requests merged by 4 people
-
Removed the holiday away message from README.md
#266 merged
Jan 6, 2025 -
Recognize json as plain text (if no other handlers are present).
#261 merged
Jan 4, 2025 -
If puremagic has no guesses, try again after ltrim.
#260 merged
Jan 4, 2025 -
Added a test for leading spaces.
#258 merged
Jan 3, 2025 -
Feature/ Add xls support
#169 merged
Jan 3, 2025 -
feat: outlook ".msg" file converter
#196 merged
Jan 3, 2025 -
fix(transcription): TRANSCRIPTION_CAPABLE should be iniztialized
#194 merged
Jan 3, 2025
8 Pull requests opened by 7 people
-
Add serve command to start CORS enabled Flask server
#235 opened
Dec 31, 2024 -
JsonConverter for Converting JSON Files into Structured Markdown Files
#251 opened
Jan 3, 2025 -
refactor: split _markitdown.py into modular components
#253 opened
Jan 3, 2025 -
Add Ollama integration for image descriptions
#257 opened
Jan 3, 2025 -
Feature: Support XLSM, XLSB & Replace excel engine with faster calamine
#259 opened
Jan 3, 2025 -
remove leading and trailing \n for HtmlConverter
#262 opened
Jan 6, 2025 -
added the ability to call Ollama client seamlessly for image description
#265 opened
Jan 6, 2025 -
Set exiftool path explicitly.
#267 opened
Jan 6, 2025
7 Issues closed by 6 people
-
trouble with writing out markdown file
#78 closed
Jan 5, 2025 -
Feature Request: Support for .msg files (for outlook)
#62 closed
Jan 5, 2025 -
[BUG] JSON not supported?
#34 closed
Jan 4, 2025 -
[bug] markitdown._markitdown.UnsupportedFormatException
#222 closed
Jan 4, 2025 -
Rust rewrite
#245 closed
Jan 3, 2025 -
DOCX not being converted in docker?
#243 closed
Jan 2, 2025 -
<code> section in html
#241 closed
Jan 1, 2025
16 Issues opened by 16 people
-
Add option to utilize LLMs to analyze and describe images within documents
#256 opened
Jan 3, 2025 -
Please add the option to use GPT models for OCR.
#255 opened
Jan 3, 2025 -
It is hoped that a conversion function for the text input box can be added.
#254 opened
Jan 3, 2025 -
Got error to convert a PDF
#252 opened
Jan 3, 2025 -
limi?
#250 opened
Jan 3, 2025 -
[Contribution] BASH Script Addon
#249 opened
Jan 3, 2025 -
When trying to convert a German pdf. I get this Error:
#248 opened
Jan 2, 2025 -
Images are not visible/useable after PPTX to md conversion
#246 opened
Jan 2, 2025 -
Docx - Add an Option to ignore Header and Footer
#244 opened
Jan 2, 2025 -
Do you have a library that converts markdown to office?
#242 opened
Jan 2, 2025 -
【bug】 UnicodeEncodeError error
#240 opened
Jan 1, 2025 -
Different OCR provider (e.g. Azure Document Intelligence)
#239 opened
Dec 31, 2024 -
`convert_hn` isn't being used properly
#238 opened
Dec 31, 2024 -
Does not work with persian documents
#237 opened
Dec 31, 2024 -
Error parsing table in word
#236 opened
Dec 31, 2024 -
where is easyocr used in this method?
#233 opened
Dec 30, 2024
12 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
UnicodeEncodeError: 'gbk' codec can't encode character '\u2009' in position 390: illegal multibyte sequence
#227 commented on
Dec 31, 2024 • 0 new comments -
optional dependencies
#103 commented on
Dec 31, 2024 • 0 new comments -
UnicodeEncodeError: 'gbk' codec can't encode character '\u2022' in position 1195: illegal multibyte sequence
#198 commented on
Jan 2, 2025 • 0 new comments -
math formula ocr
#17 commented on
Jan 5, 2025 • 0 new comments -
how to save image in the markdown
#162 commented on
Jan 6, 2025 • 0 new comments -
Invalid readme - it's not able to convert PDF to markdown
#166 commented on
Jan 6, 2025 • 0 new comments -
Extraction is not in markdown
#206 commented on
Jan 6, 2025 • 0 new comments -
Support for GitHub issue/prs to markdown
#5 commented on
Jan 3, 2025 • 0 new comments -
Updated prompt to extract text and format it in Markdown, including a…
#200 commented on
Jan 6, 2025 • 0 new comments -
[Draft] Add Web API for MarkItDown
#202 commented on
Jan 3, 2025 • 0 new comments -
Enhance _markitdown.py to support embedding images in markdown
#205 commented on
Jan 4, 2025 • 0 new comments -
Setting up docs
#211 commented on
Jan 4, 2025 • 0 new comments