Demand scenario
Chm format document, extract and save as HTML
Method 1: use online conversion tools or software
Method 2: Script Compilation
Step 1: decompile. Bat script
Using this script, you can decompile the CHM file, decompress it and get the file. Some documents can be directly converted to TXT, and some will be converted to HTML. This has something to do with generating CHM files
1 create a new TXT document and write the command according to the example. After saving, save the file as . Bat
format
hh -decompile Output path of conversion result File to be converted (relative/absolute path)
# Example
hh -decompile D:\Desktop\ 123.chm
2 right click the administrator to run the file, and output the result to the directory specified by the command
Step 2: HTML to TXT
[HTML] Python extracts HTML text to TXT
Read More:
- How to Solve wikiextractor Extract Wikipedia Corpus Error
- Python: How to parses HTML, extracts data, and generates word documents
- Python: How to Reshape the data in Pandas DataFrame
- Extracting Data from XML (Using Python to Access Web Data)
- How to Solve Python WARNING: Ignoring invalid distribution -ip (e:\python\python_dowmload\lib\site-packages)
- How to Fix Errors encountered in executing Python scripts with command line parameters
- Python: How to Solve error While importing windpy
- Python 2.7: How to Install PIP
- [leetcode] 295. Find Median from Data Stream Python
- Python: How to Encode the File (including HTML, TXT, Doc, etc.)
- Python: How to Auto Add Watermark to PDF
- Python: Panda scramble data
- Python+OpenCV: How to Use Background Subtraction Methods
- An introduction to sys modules in Python and how packages are imported and used
- How to Solve Python AttributeError: ‘module’ object has no attribute ‘xxx’
- Python: How to Disable InsecureRequestWarning error
- How to Solve Python Xlwt ValueError: More than 4094 XFs (styles)
- Python: How to Set Line breaks and tabs for Strings
- How to Solve Python picamera and raspistill error
- How to Solve Python Libsm. So. 6 error