Base
WebParserBase
Source code in Docs2KG/parser/web/base.py
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 |
|
__init__(url, output_dir=None, input_dir=None)
Initialize the WebParserBase class
Parameters:
Name | Type | Description | Default |
---|---|---|---|
url |
str
|
URL to download the HTML files |
required |
output_dir |
Path
|
Path to the output directory where the converted files will be saved |
None
|
input_dir |
Path
|
Path to the input directory where the html files will be downloaded |
None
|
Source code in Docs2KG/parser/web/base.py
13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 |
|
download_html_file()
Download the html file from the url and save it to the input directory
Source code in Docs2KG/parser/web/base.py
34 35 36 37 38 39 40 41 42 43 44 45 |
|