7 Commits

Author SHA1 Message Date
XCX
e504e73409 Changed the file path of saving data 2023-08-11 12:22:48 +08:00
ldy
7726650eaa Bug fixed:
ignored blank-space elements in the middle name list
2023-08-10 13:40:26 +08:00
ldy
71e613d994 Optimization:
less memory usage
data collection for volume HTML format error
added time elapse monitor
2023-08-10 12:57:28 +08:00
ldy
2c25682f81 Bug Fix:
1. unworkable retrying function back online baby
New Function:
1. reformatted datetime_transform funtion to handle more month typos
2. reformatted process_article function into 3 functions to use multi-threads better running time
3. renewed article url search technique to handle different volume websites
4. more exception handling
5. bettered keywords and affiliation strip method
6. added methods for processing author data when there exists no author table
7. added code for retry failed processing paper
8. more detailed error messages storage
2023-08-10 01:15:17 +08:00
XCX
9ee9bc4462 Replace the code for merging data 2023-08-08 22:57:29 +08:00
ldy
49746b779b handled 2 typos in month while formatting date 2023-08-08 13:24:51 +08:00
XCX
1e98615778 A new code for same web data merge00_File_merge 2023-08-06 19:42:43 +08:00