MultiMarkdown: Discussion

multimarkdown generates invalid XML from smart quotes and dashes

2015-10-11T18:51:55Z

What output are you getting? It works fine for me, and the HTML validates at https://validator.w3.org/ when used in a complete HTML document.

(There's no such thing as valid or invalid XML/HTML without being a complete document. By itself, the text you sent is not a complete document.)

F-

Fletcher T. Penney
fletcher@fletcherpenney.net

multimarkdown generates invalid XML from smart quotes and dashes

2015-10-11T20:47:17Z

Hi Fletcher,

Thanks for getting back to me! I see that I created an imcomplete bug
report. The problem occurs when I use the "mmd2odf" batch file to convert
the four-line file I sent you into an .fodt file. LibreOffice refuses to
open the resulting file (attached, with a different name of 'test.fodt').

Thanks for pointing me to the XML validator. Renaming test.fodt to test.xml
and attempting to validate it as XML generates this message from the
validator:

Missing "charset" attribute for "text/xml" document.

And on line 51, there is a character that isn't in the us-ascii character
set.

So... is this a problem with the header information that Scrivener is
generating - or does MMD do this? Or is Scrivener outputting characters in
the body of the text that it should convert to something else? Or is MMD
not noticing the presence of input characters that are not in the assumed
character set, and thereby generating an invalid document?

Thanks for your help with this - much appreciated!

Best regards,
Rich

Rich Mauritz-Miller

multimarkdown generates invalid XML from smart quotes and dashes

2015-10-11T22:41:47Z

The problem is apparently the file's encoding. Save your text files as UTF-8, and that should allow proper output when processed by MMD.

When I converted the file to UTF-8 on my mac, it then works just fine.

F-

Fletcher T. Penney
fletcher@fletcherpenney.net

multimarkdown generates invalid XML from smart quotes and dashes

2015-10-11T23:11:37Z

Thanks, Fletcher - I appreciate your help.

I noticed that MultiMarkdown is written in Perl. I also noticed that MMD
doesn't have a lot of bugs in the bug database. Coincidence? :-)

A quick/naive question: how do I convert my files to UTF-8 format, as you
just did?

I will forward this information to the Scrivener folks.

Finally, I'm sending you a LinkedIn invitation. It seems we're both
interesting in both Comp. Sci. and healthcare.

Best regards,
Rich

multimarkdown generates invalid XML from smart quotes and dashes

2015-10-11T23:23:52Z

The old MMD was in Perl -- Markdown was in Perl, and MMD started as a fork of Markdown.

MMD v3 and v4 are both in C.

There aren't a lot of active bugs because the project is old (original MMD was 11 years ago or so), and I've built up some decent test suites over the years.

But mostly because a large number of users help me find bugs and fix them pretty quickly, so they don't sit around for too long.

Any good general text editor should be able to convert encoding. The problem is getting the encoding read properly when opening the file.

On my Mac, TextWrangler could not interpret the "special" characters. MultiMarkdown Composer opens the file and displays as "chinese" characters. Sublime Text opened it properly, and then easily saved as UTF-8. I know Sublime Text has a single license that is good on Mac, Linux, and Windows. It's a solid app and way more powerful than I truly take advantage of. But it works well.

Of course, the best approach is to save using the proper encoding to begin with…. ;) Usually in the "Save As" dialog there are options to specify encoding in any decent text editor. Not sure what Scrivener does. Don't remember off-hand, but I think Notepad can do it.

F-

Fletcher T. Penney
fletcher@fletcherpenney.net