Excel layout kg
ExcelLayoutKG
Layout Knowledge Graph For each excel, each sheet will be a node (same a page in pdf) Then we will have the images and tables connect to the sheet, also the summary and description will be added to the sheet node
Source code in Docs2KG/kg/excel_layout_kg.py
13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 |
|
__init__(folder_path, input_format='pdf_exported')
Initialize the class with the pdf file The goal of this is to construct the layout knowledge graph
Parameters:
Name | Type | Description | Default |
---|---|---|---|
folder_path |
Path
|
The path to the pdf file |
required |
Source code in Docs2KG/kg/excel_layout_kg.py
22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 |
|
create_kg()
Create the layout knowledge graph
Source code in Docs2KG/kg/excel_layout_kg.py
44 45 46 47 48 |
|
document_kg()
Construct the layout knowledge graph skeleton first
We will require the md.json.csv file with the following columns:
- layout_json
Source code in Docs2KG/kg/excel_layout_kg.py
50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 |
|
export_kg()
Export the knowledge graph to json file
Source code in Docs2KG/kg/excel_layout_kg.py
140 141 142 143 144 145 |
|
load_kg()
Load the knowledge graph from JSON
Source code in Docs2KG/kg/excel_layout_kg.py
147 148 149 150 151 152 |
|