Keep raw data only #12

kaiix · 2023-05-16T00:09:38Z

For blogs, it is sufficient to keep only the raw data (blogs/rss/feed-*.xml, blogs/wp-content and the data transformation program, the derived data can be distributed as release assets (e.g. blogs-md.zip)

The text was updated successfully, but these errors were encountered:

hongqn · 2023-05-16T14:17:51Z

#19 提交了新的数据转换格式，咱们在这个 issue 里讨论出处理格式转换的具体解决方法吧。

hongqn · 2023-05-16T14:25:25Z

经过之前在 #15 中的讨论，我现在支持 @kaiix 的想法。

大致描述如下：

仓库中保留原始数据
创建 scripts 目录，放入各种类型转换脚本，并通过 README.md 说明用法
设置 GitHub Actions ，在 main 分支合并时自动构建 release ，转换出的各种格式用 release assets 的方法提供打包下载。

未来有新格式需求的 pull request ，应当提交转换脚本和 release workflow 的修改即可。

需要细化讨论一下的是，release 是每次合并都进行，还是手工触发。

yzqzss · 2023-05-16T14:47:16Z

赞同！

不过 blogs 目录下的情况比较特殊，此前几个 PR 做了 html 标准化、文件重命名、链接本地化等人工操作。
如果未来需要精校 Markdown 、修格式、修坏链的话，还是要人工编辑的。所以 blogs 下的东西恐怕不能用脚本从源 RSS 一路转成最终档。

kaiix assigned hongqn and yzqzss May 16, 2023

kaiix mentioned this issue May 16, 2023

Add github pages #15

Merged

hongqn mentioned this issue May 16, 2023

feat(tweets): Convert to SQLite database #19

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep raw data only #12

Keep raw data only #12

kaiix commented May 16, 2023

hongqn commented May 16, 2023

hongqn commented May 16, 2023

yzqzss commented May 16, 2023 •

edited

Loading

Keep raw data only #12

Keep raw data only #12

Comments

kaiix commented May 16, 2023

hongqn commented May 16, 2023

hongqn commented May 16, 2023

yzqzss commented May 16, 2023 • edited Loading

yzqzss commented May 16, 2023 •

edited

Loading