Commit Graph

272 Commits

Author SHA1 Message Date
Guilhem Fauré
a0fd7f0efa output fixes 2023-05-17 17:29:24 +02:00
Guilhem Fauré
6740035958 iterate over sections, creating _index.md files. commenting. more context with unknown characters 2023-05-17 17:25:19 +02:00
Guilhem Fauré
d15ad5fd8e start big refactoring to iterate over sections, then section’s articles 2023-05-17 12:18:01 +02:00
Guilhem Fauré
8021bd395e refactor 2023-05-17 12:04:04 +02:00
Guilhem Fauré
df0b623383 more typing, fixes 2023-05-17 11:43:48 +02:00
Guilhem Fauré
221aa5d3aa fixes to use the program with repr 2023-05-17 11:05:54 +02:00
Guilhem Fauré
c38274b58e init section class 2023-05-17 10:53:30 +02:00
Guilhem Fauré
834b20c9fe title after caption & caption in yaml 2023-05-17 10:24:36 +02:00
Guilhem Fauré
d7ab785b17 spip headings as h2 and article title as h1 2023-05-17 10:21:49 +02:00
Guilhem Fauré
cd8febda2d more data in articles, including internal spip IDs & translation key 2023-05-17 10:11:03 +02:00
Guilhem Fauré
6f11783f7c flag empty articles 2023-05-16 17:15:43 +02:00
Guilhem Fauré
f5c0f8cf51 multiple files for multiple languages 2023-05-16 16:48:36 +02:00
Guilhem Fauré
380f69da96 fixes for pypy 2023-05-16 16:23:42 +02:00
Guilhem Fauré
06701397da Unnamed update 2023-05-16 16:16:29 +02:00
Guilhem Fauré
619376003f fix encoding replacing 2023-05-16 16:08:03 +02:00
Guilhem Fauré
e1c8bd4b2e PEP8 refactor 2023-05-16 15:55:51 +02:00
Guilhem Fauré
aa1b822688 more typing 2023-05-16 14:01:35 +02:00
Guilhem Fauré
f23073ef12 sections directories, better unknown char messages 2023-05-16 13:29:59 +02:00
Guilhem Fauré
caf9db541f support for config file 2023-05-16 11:37:34 +02:00
Guilhem Fauré
869714f926 refactor 2023-05-16 11:31:44 +02:00
Guilhem Fauré
1076040316 use str.replace() instead of regex when not needed 2023-05-16 11:29:22 +02:00
Guilhem Fauré
b61853a4d5 unknown characters highlighting & reporting 2023-05-16 11:19:47 +02:00
Guilhem Fauré
12db0375e7 better article text build 2023-05-16 10:01:33 +02:00
Guilhem Fauré
bc616cc7a1 started allowing to gather unknown encoding bugs 2023-05-15 17:18:36 +02:00
Guilhem Fauré
b8f99fb329 refactor project structure 2023-05-15 17:10:58 +02:00
Guilhem Fauré
8eb0d1101a add conversion ç 2023-05-11 16:20:24 +02:00
Guilhem Fauré
629594de9b remove every html tag (maybe temporarily) 2023-05-11 16:11:20 +02:00
Guilhem Fauré
b3119924a8 more strict cleaning of metadata 2023-05-11 15:17:44 +02:00
Guilhem Fauré
d8b7a1b562 style fixes 2023-05-11 14:31:08 +02:00
Guilhem Fauré
65e9f0a67b more encoding fixes, warns when unknown encoding 2023-05-11 14:22:13 +02:00
Guilhem Fauré
3e3259c564 delete lark syntax 2023-05-11 13:46:34 +02:00
Guilhem Fauré
ca4a3c1a96 lowercase meta class (pyright) + do not print title in markdown & h1 headings 2023-05-11 13:45:33 +02:00
Guilhem Fauré
3a261800a6 update licence 2023-05-11 11:47:29 +02:00
Guilhem Fauré
4141c10bfc add explanation in comments 2023-05-11 11:38:38 +02:00
Guilhem Fauré
995fee5b6a fixed most of the encoding bugs 2023-05-11 11:36:23 +02:00
Guilhem Fauré
b3fa5023c4 fix some encoding bugs with regex replace 2023-05-11 10:33:35 +02:00
Guilhem Fauré
5c78dcd753 init buggy encoding example 2023-05-11 10:23:01 +02:00
Guilhem Fauré
3b36aeb776 simplified spip->md mapping 2023-05-11 10:22:50 +02:00
Guilhem Fauré
723a7ddeea simplified architecture 2023-05-11 09:50:18 +02:00
Guilhem Fauré
5e86ed0ed5 export empty articles 2023-05-11 09:25:26 +02:00
Guilhem Fauré
a4bb234b72 added pymysql again in requirements 2023-05-10 11:17:06 +02:00
Guilhem Fauré
1541cffa10 try to encode 2023-05-10 11:13:43 +02:00
Guilhem Fauré
e4a0eb68af better cli 2023-05-10 11:03:13 +02:00
Guilhem Fauré
cf2345e43e regex replacing spip to markdown conversion 2023-05-10 11:00:27 +02:00
Guilhem Fauré
8a6026d129 try with basic regex replacing 2023-05-09 17:38:18 +02:00
Guilhem Fauré
a455c8e4a2 add pyparsing, lark not adapted to complex languages like SPIP or Markdown 2023-05-09 16:47:02 +02:00
Guilhem Fauré
8eec4033f8 paragraphs cannot start with tags 2023-05-09 16:31:34 +02:00
Guilhem Fauré
8f9775119c multiline headings 2023-05-09 15:52:18 +02:00
Guilhem Fauré
8f4fcccbdc accept " in tag options 2023-05-09 14:57:55 +02:00
Guilhem Fauré
c5c04cc645 more precise tags, added problematic tag to tests 2023-05-09 14:51:40 +02:00