Generated blogbook of my Misc. Tech. WordPress blog using my VBA program WPBlogExportFileToBook

Last updated on 29 Feb. 2024

I decided to put up and publish this post now on 8 Aug 2023 while this work is in progress, as I wanted to ensure that whatever work I have done till now is available for interested readers. 

Quick-Info:

Public folder share having code and data: WPBlogExportFileToBook, https://drive.google.com/drive/folders/1YQ7HAEx-mkpY0GjRFAkicK5BjFDUcGfZ?usp=drive_link .

ravisiyer.wordpress.com blogbook (in HTML) with pages, posts & comments and contents links (like table of contents): ravisiyermisctech.wordpress.2024-02-24.000-BlogBookwCLs.html (1.3 MB) (in v6 folder), https://drive.google.com/file/d/10aRzozLYicANJJr4j98MKxyBmLMVWXsI/view?usp=drive_link (download file and view in browser locally to see rendered HTML).

--- end Quick-Info ---

29 Feb. 2024 Update

Version 6 (v6 folder, https://drive.google.com/drive/folders/1Hdy3BtFc6lnJ95GfoGp5QvydVxzTpv8A?usp=drive_link ) has latest version with some bug fixes described in info.txt.

29 Feb. 2024 End-Update

9 Aug. 2023 Update

Version 5 (v5 folder, https://drive.google.com/drive/folders/1BmNmYCjOK7FpqcW6hbtMVmcynnxRpZpi?usp=drive_link ) has the latest version which creates blogbook with pages, posts and comments (all required content) of the blog. It is preceded by version 4 (v4 folder, https://drive.google.com/drive/folders/1pc2rPydVi9hFxphwszqd_MeSbtrIpVYM?usp=drive_link ).

The key files and folders of interest are:

  1. info.txt in v4 folder:  https://drive.google.com/file/d/1NRZUWMIbnazkW60eHia_NHZiXW5rjiyk/view?usp=drive_link 
  2. [29-Feb-2024 Update: PreTag issue solution. end-Update] PreTagIssue folder: https://drive.google.com/drive/folders/1XOLxZd_H7vEdl4W-xFpBuO_z7MWI6QLM?usp=drive_link and Info.txt in it: https://drive.google.com/file/d/17oPBo8ICYZgTp1r5CKv9EgmcTl1lr_5w/view?usp=drive_link
  3. info.txt in v5 folder:  https://drive.google.com/file/d/1FIm5SwMl9L10ZdX3qjFnVfNqr9Tt1g9U/view?usp=drive_link 
  4. Blogbook generated by WPBlogExportFileToBook in v5 folder: ravisiyermisctech.wordpress.2023-08-09.000-BlogBook (1.9 MB),  https://drive.google.com/file/d/11aBEQqEj91oC9VwRMR1SrT8p1AbuUvxY/view?usp=drive_link (download file and view in browser locally to see rendered HTML)
  5. Log file for run that generated above blogbook file: ravisiyermisctech.wordpress.2023-08-09.000-BlogBookLog,  https://drive.google.com/file/d/1s7LphFPtmGY8Kee0f0Z9xc5phsl9lZt3/view?usp=drive_link
  6. Version 5 source code: v5-MainWPBlogExportFileToBook.gs.txt,  https://drive.google.com/file/d/1QEpPXs8wKAEDxYki0btm2xexAdHWrQ9a/view?usp=drive_link
  7. Blogbook (in HTML) with pages, posts & comments and contents links (like table of contents):  ravisiyermisctech.wordpress.2023-08-09.000-BlogBookwCLs-utf8.html (1003 KB) (in v5 folder), https://drive.google.com/file/d/1uCBbSkXqqUSqdNCZEA940-rqvcIZ4RYs/view?usp=drive_link
  8. Above blogbook saved as PDF: ravisiyermisctech.wordpress.2023-08-09.000-BlogBookwCLs-utf8.pdf (40.1 MB, 382 pages),  https://drive.google.com/file/d/1fxEhPkHPGlkceFEnxIeQIBUxQgLpZKBu/view?usp=drive_link

Note that input (XML export) file is in top-level folder share (above version folders).

9 Aug. 2023 Update end

The first step in the work involved studying WordPress export file format which is significantly different from Blogger blog backup/export file format. Data and notes about this study are shared in folder: WordPress-Export-Format-Study, https://drive.google.com/drive/folders/1QYkC0m30JOfd0_8JCyH31qljg5Z2KEUr?usp=drive_link . The Notes text file in it, https://drive.google.com/file/d/1nUG4Ilh-lfjmzGDM35qK4pdvbYG3-vHm/view?usp=drive_link , gives an overview of the study.

The next step was to copy BlogExportFileToBook VBA code, rename it as WPBlogExportFileToBook and then modify its code to handle WordPress export file instead of Blogger export file. For more info. about BlogExportFileToBook, see my post: BlogExportFileToBook: VBA code to produce HTML blog book from Blogger blog backup/export XML file,  https://ravisiyermisc.blogspot.com/2023/07/BlogExportFileToBook-vba-code.html .

Initially I tried to include all posts and pages (without comments to start with) of my WordPress blog: Ravi S. Iyer Misc. Tech., https://ravisiyer.wordpress.com/ . But the generated blogbook had some formatting issues for posts. As I already had a separate posts blog book of this blog using my Google Apps Script program - WordPressFeedToBook, I chose to simply skip posts and focus only on pages for this WPBlogExportFileToBook VBA program. Later, if time permits, I will explore including posts too and try to fix the formatting issue that I had observed earlier. For more about WordPressFeedToBook, please visit my post: WordPressFeedToBook Google Apps Script to Create WordPress Blog Book (posts and comments) from Feed: Description and Stable Version Info,  https://ravisiyermisc.blogspot.com/2023/07/wordpressfeedtobook-google-apps-script.html .

Initially I included only the pages and not comments in them, in the created blogbook. After that got done, I added the comments too which was not very difficult as the blog XML export file's item entry for a page contains comment data too (as comment elements). The current version, v3, seems to be stable.

The folder share is provided in Quick Info section above and also links to the blog (pages) book (contained in the folder). The key additional files of interest are (v3 is the folder and version for all the output files listed below):


Comments

Archive

Show more