Friday, December 1, 2006

Manual of Style (dashes)

em-dash and en-dash
Question on Style. I recieved the following comment on a page I edited:
(&am; "#151;" is not a valid HTML entity... it should be & "mdash;" or & "#8212;").
I think & "#151;" is perfectly valid for a "printer's em or em dash" Anyone know why it is not? Also, should not the em be separated by spaces from the rest of the text, since it is NOT an ordinary dash, but a device for redirecting rthouyght within a sentence? Anybody know about this? Nextel ringtones Marshman/Marshman 04:47, 2 Aug 2003

:In ISO 8859-1 and Abbey Diaz Unicode, code point 151 is reserved as a control character. It is not an em dash except in Free ringtones Microsoft's proprietary code page extensions, and any program that displays an em-dash for "—" is doing so either erroneously or in deliberate emulation of common bugs in Windows. Relying on buggy behavior is not recommended. :) Please use the standard, either — or —. Majo Mills Brion VIBBER/Brion 05:02, 2 Aug 2003
:Thanks I will use — in the future. Coffee-Cup Software HTML Editor inserts "—" for an em-dash and it certainly displays that way on browsers. Why the confusion? Mosquito ringtone 24.94.86.252/24.94.86.252 05:36, 2 Aug 2003 me not logged inn Sabrina Martins Marshman/Marshman 05:38, 2 Aug 2003

:: please don't! — looks very ugly in wikisource and some editors may not know what it is. Stick to ". I know it's ugly, but in future our parser may turn that into mdash automagically. Nextel ringtones Tarquin/Tarquin 12:17, 2 Aug 2003

::: Does hardly look more ugly than ''L&uoml;beck.''


::: " gets really ugly when broken between lines, " - " would be a better advice.
::: Abbey Diaz Ruhrjung/Ruhrjung 12:42, 2 Aug 2003

:::: — is the only form that renders correctly on the largest possible number of browsers. — is almost as good. No other form renders correctly, except by mistake. Always use — or —. Free ringtones Tannin/Tannin 14:07, 15 Sep 2003


Dashes
(from the Majo Mills village pump)

Cingular Ringtones User:Wik seems to insist on replacing ndashes – with ASCII dashes -. Style guides for printed work such as encyclopedias, as well as Unicode, state that for ranges such as dates an ndash (1998–2000) and not a dash (1998-2000) should be used. One advantage of using the correct dash is that a linebreak won't occur on the right of it.
Is there some official policy from the Wikipedia on this, or should I just wait until Wik tires of his game and restore the correct dashes? critic harold Darkelf/Jogoblet of User talk:Darkelf/'''r''' 01:00, 12 Feb 2004

:Well, if you're prepared to insert the "correct dashes" into all the tens of thousands of articles which now have the ASCII dashes, go ahead. chinatown one Wik/Wik 01:04, Feb 12, 2004

:: Okay. I will interprete your quote above in that you'll start leaving them alone from now on. a luminous Darkelf/Jokeep adventurers User talk:Darkelf/'''r''' 01:05, 12 Feb 2004

:::No, only if you go through all articles and make it consistent. I will always edit the articles to fit the de facto standard. Currently, that's the ASCII dash. morse the Wik/Wik 01:07, Feb 12, 2004

::::Please use two ASCII hyphens in a future version of MediaWiki this will be automatically converted to –. The problem with using one hyphen is that they're very difficult to find and convert once the new feature is implemented. I'd be quite happy with people using – in the meantime. oakland pitcher Tim Starling/Tim Starling 01:11, Feb 12, 2004

Wikipedia articles are about being easy to read ''and'' edit. The average non techie reader has no idea what the sequence of characters "–" is supposed to mean. It makes the article source ugly and therefore harder to edit. This kind of stuff should be kept at a minimum.discovery linked Eloquence/—Eloquence 01:34, Feb 12, 2004

: I also don't like ndashes as they make editing harder. administration lawyers Dori/Dori / progeny surely User talk:Dori/Talk 03:14, Feb 12, 2004

:An ndash and and an mdash are NOT the same thing, and a '-' is not a substitute for an mdash;. I agree with Jo. Stop putting in ASCI dashes anywhere. Or else you are going to be real busy for the rest of your days because I use only ndash and mdash and will change any ASCI dashes I encounter to the correct form (something a BOT cannot do). And a should become an mdash not an ndash. The look of the "source code" is not an issue. Incorrect English prevails over making editing "easier" : Maybe we should just ignore spelling too - kennedy wasted Marshman/Marshman 05:42, 12 Feb 2004

:park hemmed Manual of Style (biographies) uses regular ascii dashes in dates (he mentally 1999 - disputes trade 2005). I don't see what the problem with them is personally. It makes editing easier and looks fine when rendered to my eyes. The manual of style isn't compulsory, but it's the only guideline that should be applied to wikipedia IMO. If it's under debate then hash it out on the talk page and modify the guidlines if necessary when a consensus has been reached. m cautiously Fabiform/fabiform / to sedlak User talk:Fabiform/talk 06:57, 12 Feb 2004

::Can anybody please explain why this matters at all? A dash is a dash is a minus sign... or not? And minus signs are far easier to use compared with some "&..." character sequence. Furthermore, it is my impression that everything else looks ugly in Mozilla-based browsers. The advandages of "&..." listed above look not too significant compared with the ease of editing that "-" offers. So, what are the reasons for using the "&..." things? Specifically, why are they considered "correct"? an overall Kosebamse/Kosebamse 11:19, 12 Feb 2004

::: See few flowers Dash (punctuation), and in particular, the external link at the bottom, http://www.alistapart.com/articles/emen/. While you are at it. look over http://www.alistapart.com/articles/typography/—a short essay on the theme "Typography, at the root, is all about providing as many helpful cues for the reader’s eye as possible." its alliance User:Tannin/Tannin

::::It can also be important when "viewing" pages through a different kind of browser, for example having a text-to-speech engine read it aloud. The different kinds of dash/hyphen can be used to cue different pauses or emphasis. HTH HAND nezavisimoye voyennoye Phil Boswell/Phil 12:17, Feb 12, 2004

::Fair enough. Still, everything except minus signs looks plain ugly on a monitor (at least under Mozilla et al.), i.e. there is not a helpful visual cue but a distraction, i.e. it is counter-productive to use the "n" and "m" things. Is there a solution to the display problem? Kosebamse/Kosebamse 12:44, 12 Feb 2004

::: Get your eyes adjusted, Kosebamse. No, I'm not making a smart crack here-if proper typography looks ugly to you, you have been spending too long reading badly set web pages, or student term papers, or some such. Take a break from the 'pedia and read some real printed-on-paper things (books, National Geographic, anything you like) till your eyes adjust themselves back to normality. As for Mozilla, it ''is'' ugly. Always has been. The most stable and practical browser around but ugly as a hatful of ar.... um ... bottoms. If you like pretty, use Opera. Or, if you must, Explorer. Microsoft have always been good at pretty. Tannin/Tannin 13:14, 12 Feb 2004

::: PS: I usually use Mozilla for most things, but nearly always Opera here (don't ask why, just habit). Looking at the page as rendered by Mozilla just now, it's fine. Perhaps your problem is the font support in Linux. Linus still has crappy on-screen fonts. Tannin/Tannin 13:21, 12 Feb 2004

:::: Oh my god, please stop spewing out misinformation. First of all, Mozilla has no problem with en or em dashes, minus signs, quotation marks, or most other relatively common characters in Unicode. If your font has the character, it will be displayed in Mozilla—just like with any other graphical browser. Second of all, this has ''absolutely nothing'' to do with Linux. Linux is an operating system kernel that controls your hardware and says which process gets to run when. It does not care in the slightest about em dashes. Finally, with that said, do check out the free, high-quality Bitstream Vera font family. —Daniel Brockman/Daniel Brockman 08:50, Mar 7, 2004

::Ah well... I like to see myself as a bibliophile and book maniac and could not agree more that good typesetting is A Good Thing. The "m" dashes are displayed too long, too high and without right or left spaces on Mozilla (under Linux). It would be A Very Good Thing to fix that but on which level? Browser? Style sheet? Font? Kosebamse/Kosebamse 13:28, 12 Feb 2004

:::Font, I suspect. Am m-dash should be exactly the same width as the letter "m" (uppercase or lowercase? I can't remember) in whatever font you are using. The "lack" of spaces is not an error. That's the way an m-dash is ''supposed'' to be rendered. Some-mostly American-publishing houses have taken to inserting spaces on either side of an m-dash in recent years. I have no idea why. A micro-space is acceptable if desired, but a full space ... well ... what is it they say? Two nations seperated by a common language? User:Tannin/Tannin

::Quite possible it's the fonts. I have played around a little and they all look either like a minus or as described above (much wider than a lowercase "m" and too high). Kosebamse/Kosebamse 13:50, 12 Feb 2004


= Do not forget we are a wiki! =
I must insist that we NOT use HTML entities in raw wiki markup. This is a barrier to editing to all the non-technically-minded people who do not know what — means when they see it in raw wiki text. Irrespective of what is correct typograpy, we must work with the tools at our disposal, and we must remember that this is a wiki and clarity in raw source is as important as clarity and accuracy in rendered form. Tarquin/Tarquin 16:42, 17 Mar 2004

:I am relatively new, and I disagree completely. I know a lot of other people do, too. Can we vote on this or something? - Omegatron/Omegatron 17:12, Mar 17, 2004

:Or perhaps a tech fix could solve the disagreement; replacing the 4 used dashes with special codes, for instance, as the horizontal bar has been replaced by ", and people were talking about replacing en dashes with " or whatever, which would be rendered as the correct HTML character. Probably something like "-en-" would be better and easier to grasp. This would be very easy for newbies to grasp, and would still format articles in a readable way. - Omegatron/Omegatron 17:15, Mar 17, 2004

::Getting the automatic conversion function back up would solve a lot of this tension. And "clarity in raw source"? Well, that disqualifies every single article with a summary box. Hajor/–''Hajor'' 17:23, 17 Mar 2004

:::And just to throw in my two cents on the "evolution of typography"; I agree that throwing out "correct typography" is akin to throwing out proper spelling. Sure, we could use "online style" for everything, and "embrace the new internet style", but by that logic, "WELCOMA 2 WIKIEPDIA111!!!!! OMG LOL W3 R BUILDNG A MULTILNGUAL COPYLEFT ENCYCLOPADIA TAHT WIL ALWAYS BLONG 2 3VERYONA!1!111 LOL" is a perfectly valid intro. It's an encyclopedia. It should look good. There is nothing wrong with conforming to "old" standards. We certainly shouldn't force people to use them, but there is nothing wrong with using them, either. - Omegatron/Omegatron 17:43, Mar 17, 2004


: Sevral points, Omegatron: yes, articles that begin with lengthy tables are a bad thing in many ways. There have been suggestions to move these to another namespace and insert them in pages. second, you need to understand how wikis work: theire open nature is crucial. Complex HTML terms are a barrier. Thirdly, I never mentioned "evolution of typography". What I SAID is that we have imperfect tools, that were not designed for typography, namely the basic ASCII set used on the internet. This is what we must work with, for now at least. Try to accept that WP is a work in progress! :) Tarquin/Tarquin 19:55, 18 Mar 2004

:: I have much more experience with WP than standard wikis, but if I am not mistaken, typical wikis don't have images, tables, TeX markup, boilerplate text, or normal text links (they have ugly CamelCase). All of these things are very good to have, make it much more encyclopedia-like, respectable, etc (if WP used camelcase I would probably never have come here. An article that looks crappy gives the impression of having crappy info). They should not be removed just because they make the markup a little more difficult to edit. Dashes are pretty trivial, but HTML entities in general should not be expressly prohibited just because they are confusing the first time someone sees them. Possible solutions to the disagreement are

::# Put a description in the first few pages that new editors see. Perhaps a "what are these &number; things?" or "what are these special symbols?" at the bottom of an edit page. (near Editing help) (I just checked, and editing help itself has a huge list of possible HTML characters, explaining plain as day what they are for)
::# Automated conversion - give some or all of the html entities less ugly formats that are automatically converted ( or -n- or -en- or or [special character: en dash] becomes – for rendering) Obviously you need to make it obvious to newcomers that the code is supposed to be there, without being hard to read, yet without making it so long that no one will type it. - User:Omegatron/Omegatron

:::It sure would be handy if there was a more capable browser based text editor provided to edit Wiki articles, with context driven online help. Or, maybe we could put together a very easy to use guide to markup sequences, in general, from the point of view of a beginner. - Bevo/Bevo 21:14, 18 Mar 2004

=Automated dash conversion and digital representation=
Well the new dash conversion has just gone live. One hyphen - ; two ; three - ; and of course four is the horozontal rule. We've all been muttering about having confusing "&..." symbols in the wiki editing box, but it just occured to me that this wont happen with the new markup. In the editing box the n-dash (if it was entered that way) will just look like , just as horozontal rules display as . Fabiform/fabiform / User talk:Fabiform/talk 12:38, 12 Feb 2004

:Yay! Finally we can have correct dashes! No more ugly poor man's dashes! (I'm with the mdash-ndash camp on this one.) Seav/seav 13:07, Feb 12, 2004

::Don't hooray too loud, as the same one breaks the new wiki table markup, where /- works, but / doesn't work anymore. But Tim already heard the complain on IRC... Ahoerstemeier/andy 13:10, 12 Feb 2004

::::I briefly switched this on using a live patch of two lines of PHP code, which was a bit silly because it broke a few things. I switched it off when I realised it broke links to titles containing , of which there are about 140. There was some contention on IRC as to whether should be expanded as an en dash or an em dash. Tim Starling/Tim Starling 23:24, Feb 12, 2004

:::::Two dashes should be an em dash of course. An en dash is represented in ASCII by a single dash, and as such cannot be automatically fixed but must be done manually. Darkelf/JoUser talk:Darkelf/'''r''' 22:01, 13 Feb 2004

:::::: Actually, it makes more sense that two hyphens make an en dash, and three hyphens make an em dash, for at least three reasons: (1) It enables usage of the en dash; (2) it uses only one token for the em dash (i.e., "-"), whereas two hyphens would encourage people to put spaces around the dash (thus: " "), which complicates parsing and takes away power from the style sheet; finally (3) this (using three for an em and two for an en) is how TeX has done it for decades.

:::::: I strongly advise that this long-established convention be adopted. The table syntax invented—what?—some momths ago?—can easily be adjusted to make this possible. I doubt that this would cause more confusion than throwing out the logical, intuitive, and well-tested TeX syntax. —Daniel Brockman/Daniel Brockman 08:50, Mar 7, 2004

:::Has this also got something to do with why the "nowiki" tags are showing at the top of this page: (3) Sign your name and date (by typing "nowiki"–~~ ~~"nowiki"? Fabiform/fabiform / User talk:Fabiform/talk 13:17, 12 Feb 2004 (not on irc).

::::Seems to be related - on Template:Villagepump it shows correctly, but once imported here it shows the nowiki's and there is a double - inside the nowikis. Ahoerstemeier/andy 13:24, 12 Feb 2004

Doesn't Wikipedia use UTF-8? Can't we just insert the actual mdash and ndash characters? That would make editing much easier. 137.222.10.57/137.222.10.57 17:03, 12 Feb 2004 .

:No, most English and Western European wikis use ISO 8859-1, for maximum browser compatibility. Tim Starling/Tim Starling 23:24, Feb 12, 2004

OK. Now I'm confused. According to the recent additions to Manual of Style, one should use a single-dash-without-spaces to represent a simple hyphen (as in date ranges), a single-dash-surrounded-by-spaces to represent an ndash, and two dashes (ie, ) for an mdash. However, if I follow the above conversation correctly, it seems that the software will convert a double-dash into an ndash and a triple-dash into an mdash. Am I misunderstanding, are there two incompatible standards being developed, or has something changed? -Rholton/Rholton (aka Anthropos) 23:51, 16 Feb 2004

:OK, you confirmed what I suspectedthat I thought it was resolved but there is in fact no clear statement to that effect. I'll try to rouse up a clarification of what's really going on. Elf/Elf 01:44, 17 Feb 2004

:When the automatic conversion was briefly turned on, a - remained unaffected, turned into a dash (an n dash I assume) and - turned into a longer dash (an m dash I assume). I have nothing to do with the programming though, so you might want to talk to someone else about this, especially if you would prefer it to be done another way. :) Fabiform/fabiform / User talk:Fabiform/talk 07:27, 17 Feb 2004

Not resolved. A few people involved in the field seem to find the use of the hypen by ordinary people for all of the purposes offensive. Those same ordinary people have no objecting to ignoring the attempt to prevent the language from evolving and ignore the variations in dash lengths now that normal people can easily write and publish and not follow the conventions which used to be used in the print world. Effectively, a small group is trying to enforce an undesired style rule on everyone else, when usage clearly indicates that the majority of contributors do not agree. Simply, the online style for almost everyone is to use - for everything. Since we're a wiki, we do have to accept that change in style expectations, because it's not practical for a few people to force everyone to do what they want. Just document the way most people do it - the simple hypehen - and document that it's accepted that those who are writing new text and object can do it the print way if they desire but are discouraged from changing the writing of others. Jamesday/Jamesday 02:46, 17 Feb 2004

: You are contradicting yourself. You start out by arguing that the pro-dash people are trying to “prevent the language from evolving,” and then go on to say that we should definitely use hyphens everywhere because “it’s the online style.” This begs the question: aren’t ''you'' preventing the ''online style'' from evolving/maturing?

: In my not-so-humble opinion, it would be an insult to humanity—and, specifically, the people who work with on-screen typography research, Unicode, the W3C, the Mozilla Project, and others who put in effort to enable to use of good typography on the World Wide Web—to throw out the typographical lore accumulated over the course of centuries just because some PHP script can’t do this or that, or because some people allegedly can read all text equally well no matter how badly formatted. As someone else noted, you’re not proposing that we abandon other seemingly “unnecessary” and “troublesome” English punctuation rules, such as italicising emphasized words or having a whole bunch of different punctuation marks—e.g., comma, semicolon, period, colon, dash, parentheses—that all basically mean “short pause”—or are you?

: Finally, yes, this is a wiki. This means that if ''you'' don’t know or care about the difference between the variously sized line segments sprinkled about the text, that’s okay, because ''I''—and probably a hundred other people who are willing to edit your text—do. I completely fail to see the logic in arguing that the collective competence of thousands of editors of a wiki could somehow be ''less'' than that of, say, one or a few of a paper. No doubt, an article is read a lot more often than it is edited, so it makes sense to spare all the future readers some eye-strain. irritation, and confusion, at the editor’s expense of a small one-time (or more likely few-times) typing cost.

: —Daniel Brockman/Daniel Brockman 08:50, Mar 7, 2004

:: Daniel, please don't use HTML entities, it makes your text extremely hard to read in raw form. Tarquin/Tarquin 10:08, 7 Mar 2004

:::So who's going to want to read it in '''raw''' form when Wikipedia so nicely formats it for us? Phil Boswell/Phil 10:10, Mar 9, 2004

:: Your point seems to be based on the mistaken belief that the use of multiple types of dash is moving forward. It's not, it's moving back to conventions based on the limitations of newsprint and paper reproduction. The web needs to support the old print conventions, to support republication of old documents and to support those who want to apply print conventions online, because that's how they have been trained and what they are used to. We don't need to use things just because they can be used. Sparing readers and editors eyestrain, irritation and confusion is why I'm going with the usual practice here, which is the hyphen for everything. I've no problems at all with not using rules learned over centuries when rules learned over the last twenty years show them to be inappropriate. One good example of such abandonment of inappropriate conventions is the usual choice of sans- rather than serife fonts online. Since it's unlikely that those used to print will convert to other conventions - they are more likely to believe that they must be right - the leave it alone approach seems best here, so those used to print don't feel that their own writing is wrong. Jamesday/Jamesday 03:05, 9 Mar 2004

::: So you're saying the em and en dash are obsolete? Proper typography causes eyestrain? You're joking, right? ''— Wikisux/Wikisux 00:20, 2 Jun 2004 ''

::: The choice of sans-serif fonts rather than serif fonts has everything to do with the relatively low resolution of display monitors versus the printed page. Serifs simply don't render well at typical sizes on a 100dpi monitor. I think it is safe to assume when monitor resolution comes more into line with printed resolution, say around 300dpi, we will see a resurgence of the use of serif fonts which are far more readable than sans serif. I cite personal experience before I learned proper typography and references such as "The PC is Not a Typewriter" as evidence. Jason Michael Smithson/Jason Michael Smithson 12:49, 2004 Sep 13

Disputed paragraph
:''Evolving language and the decreased reliance on print world conventions have led to the hyphen becoming an acceptable replacement for other dashes. Where hyphens have been used in place of other dashes, you are discouraged from changing these, in the same way that changing spelling forms is discouraged. (See #Usage and spelling).''

The statement that hyphens are now acceptable substitutes for other types of dashes had been added to the main page. In an attempt to avoid an edit war, I added a notation that this is disputed rather than removing it again. I just do not believe that this is the case. I don't know of any style guides or professional online publishers that have said that punctuation rules have changed. And I don't buy that, just because lots of people do it, that that makes it correct. (By that rule, "its" and "it's" would be interchangeable, for example.) And I certainly object to having the statement put into the style guide that we're not supposed to correct somebody else's punctuation when we come across it. I could live with the inclusion of the observation that some people feel that hyphens are acceptable to use for other dashes. And I expect tht in Wikipedia, people will type what they're comfortable with. And then other people will come through and clean it up. I get the impression from all of the various preceding discussions on dashes and hyphens that "hyphens-are-legitimate-for-anything" is a minority opinion, and "hyphen-once -typed-by-one-person-are-immutable" is very much a minority opinion. Elf/Elf 05:48, 8 Mar 2004

: I couldn't agree more. I think that we should aim for Wikipedia articles to look clean and professional. This goes hand-in-hand with NPOV. There is plenty of Internet left for those who wish to experiment with new, "online" punctuation rules. —Daniel Brockman/Daniel Brockman 21:21, Mar 8, 2004

::I've commented this out for now and just noted that the hyphen is commonly used in place of other dashes. I disagree that it should be "corrected", and I believe the safest option is to go with the same policy we have for spelling to prevent edit wars over this. I personally regard pages containing text such as "“”" to be highly unreadable. Angela/Angelauser talk:Angela/. 02:33, Mar 9, 2004

:If you think it's a minority opinion, I suggest that you try the exercise of listing those who have on this talk page expressed opposition to or support for the use of different dash types. My quick count placed those opposing it in the majority, with those used to the print world being at least a high proportion of those who favor it. Jamesday/Jamesday 03:10, 9 Mar 2004

:: Lots of people will find the use of hyphens instead of dashes annoying, as you say, because they are used to the print world. After all, there's a pretty good amount of people who read books. On the other hand, I suspect that very few, if not no one, will find the use of proper dashes annoying to read. Granted, some might find it annoying to ''type'', but that's been shown to be a non-problem, as other people will later come to clean it up.

:: As for being annoying to read ''in the edit box'', well, let's face it: the source ''is'' and ''will always be'' harder to read than the rendered page. I think that in this case, we're talking about a rather minor decrease in source readability (em dashes are relatively rare) in exchange for a rather major enhancement in the appearance of the rendered page: Hey, what's—what? ⇒ Hey, what's—what? vs.Hey, what's - what? ⇒ Hey, what's - what? Compare, for example, to Angela's example: He said, "what's up?" ⇒ He said, "what's up?" vs.He said, “what’s up?” ⇒ He said, “what’s up?”. Whether the apostrophes are a little bent makes a minor difference, since people will know they're apostrophes anyway. The length of dashes, on the other hand, makes a major difference, since a long dash carries a completely different meaning than a hyphen.

:: It might be possible to render " " or "-" as an em dash automatically, just as two apostrophes are rendered as emphasis markup. I'm all for this, since it would give us the very best of both worlds. (The new table syntax is very low-priority compared to this, IMHO.) In fact, I don't see how anyone could object to it. So what's the status on this? :-) Is it in the process of being implemented? —Daniel Brockman/Daniel Brockman 09:38, Mar 9, 2004

:::In parallel with the automated conversion (which I think is a good idea: I reckon " should go to N-dash and "-" to M-dash, just like ''' makes things '''bold''') maybe the toolbar could be extended to add in various types of dash. If people get used to clicking a button then you just need to alter the code behind the button. Phil Boswell/Phil 10:14, Mar 9, 2004

::::The problem with having three dashes (-) go to emdash instead of two dashes () is that the latter has been taught as an emdash in typing classes probably since the advent of typewriters, and publishers who buy manuscripts from writers have used this as the standard for emdashes (and still do). I realize that TeK, which is a markup language, uses - as the markup for an emdash, but that's a markup language, it's not the standard for typing. I'm going to reinsert the text that says use if you don't want to (or can't) use the others, because this is correct.

::::: Wiki markup is just as much of a markup language as TeX is. Yes, representing em dashes by is an existing convention. But this was the case a few decades ago, too, when Donald Knuth decided to go aginst the convention, supposedly because it was too ambiguous. Today, both and - are in widespread use. The former is still more common than the latter, but people who have used TeX are likely to stick to three hyphens, at least for text that is going to be parsed by a machine—as in the case of wiki markup—because they are aware of the ambiguities that would arise were both kinds of dashes represented by . I feel confident that Knuth made the right choice, and I believe we are now facing the exact same choice. —Daniel Brockman/Daniel Brockman 22:41, Mar 9, 2004

::::Furthermore, I don't really care whether people use dashes or the ampersand formats or whatever. What I do object to are (1) having info in the style guide that gives misinformation (hyphens just simply not the same as em dashes and just because people use single hyphens in their place doesn't make it correct; read some style guides) and (2) having info in the style guide that prohibits me from editing what other people have done. We made a real effort to specify what the different punctuations mean, to show the markup to use if you want to use it, and an alternative using regular dashes if that's what you want to use. I think that "-" no spaces for hyphen, " - " single with spaces for en dash, " double with or without spaces for em dash (I understand UK publishers sometimes prefer spaces) pretty much covers those options. Elf/Elf 20:05, 9 Mar 2004

::::: I appreciate your careful efforts here Elf, but the bottom line is that using to represent an em dash is doing just that: it ''represents'' an em dash, it does not pretend to be a replacement for it in anything bar a typed manuscript on its way to a publisher to be set properly. Bad typography is every bit as sloppy and unprofessional as bad spelling. I don't expect every Wiki contributor to get his or her punctuation right first time, but the Manual of Style certainly should not endorse bad punctuation as the standard. If you want to abandon correct language, please first demonstrate a consensus to do so. Tannin/Tannin 20:41, 9 Mar 2004

::::::I expected to be stressed about this but instead you've made my day. I really am laughingbecause I'm usually known as the harridan of correct typography and punctuation. It's so weird to be accused of being the opposite! Anyway, thanks for responding and I feel much better now. Maybe I'm repeating myself: I don't see that there is a disagreement that is never legit for em dashes, only whether to use markup of - for emdashes (which see above) or whether single hyphens are acceptable substitutes, or whether markup should be required (which I think is your view) or shouldn't ever be used. I'm trying to be realistic. Double dashes have been taught in typing classes for so very long (I don't instinctively type — when I'm in the throes of writing text, I type ) that it can't be eradicated. And people who don't want to muck with markup are going to type something in place of em dashes, and it just seems to make more sense to identify the existing punctuation convention than to annoy people who hate typing markup. That's how we got into this whole discussion, because not everyone likes typing markup. Elf/Elf 20:57, 9 Mar 2004


::::::: This is perfectly fine with me. What I object to is ''reccomending'' that people use a single hyphen in place of an em dash, and ''forbid'' other people from correcting it (yes, I view this as changing incorrect langage to correct language). I do not expect people to use the correct punctuation all the time, just as I do not expect people to write perfect prose all the time. What I ''do'' expect is (1) the right to improve on other people's subperfect language, and (2) that someone will eventually come along and improve on ''my'' subperfect language. —Daniel Brockman/Daniel Brockman 22:41, Mar 9, 2004

=Current status=
:OK, we've tried to add NPOV description of the various dashes that does the following:
*''Briefly'' describes the standard usage for the various dash types. (For detailed descriptions, there's a reference to Dash (punctuation).)
*Shows the special markup that's valid for each type of dash and also identifies how to represent each using the hyphen key on the keyboard. (Note that I don't have final info on whether someone implemented an automated tool that changes groups of hyphens to something else.)
*Gives a nod to the fact that there might be technical issues involved in using the special markup.
Elf/Elf 17:17, 20 Feb 2004

:I agree with the above. But I must strongly oppose the use of () for en dashes or (-) for em dashes. for en dash is absolutely wrong as it is far too long, and for em dash is wrong because in all computer fonts I know of they do not connect, and thus instead of providing the "long dash" which it should (as the practice does on old-style typewriters) it gives an ugly series of characters. Darkelf/— Jor User talk:Darkelf/(Darkelf) 22:18, 16 Mar 2004

What happened to the TeX-style autoexpansion of to – and - to —? This would be ''very'' helpful... —Tkinias/Tkinias 21:23, 5 Apr 2004

:Whatever happened to this? I thought it was a good idea. It just doesn't work with table markup? Sounds easy to fix. - Omegatron/Omegatron 19:09, Jan 21, 2005

::I completely agree. I'd love the TeX behavior. If articles consistantly used en-dashes in the usual way and un-spaced em-dashes as a gramatical dash, then a "dash style" could turn that semantic dash into whatever format people like, just like the date style. (Personally, I like the look of this – as a dash over—that, but semantic markup would allow us to all get along, even if we are using a cell phone and want - that as a dash.) — BenFrantzDale/BenFrantzDale 06:30, 28 Jan 2005

=Minus signs vs figure dashes=

Wow. This is an insane conversation. Anyway, I don't mean to start another feud, but I see in both the Dash (punctuation) article and in Manual of Style article (that goes with this page), figure dashes are equated with minus signs. The HTML entities for each are different, however, as I pointed out in the dash talk page.

Hyphen:
+-=-=
-+-=-

Minus sign:
+−=−=
−+−=−−−−−

Figure Dash:
+‒=‒=
‒+‒=‒‒‒‒‒

TeX:
1+2-3=

Hyphen:
1 + 2 - 3 =

Minus sign:
1 + 2 − 3 =

Figure dash:
1 + 2 ‒ 3 =


1 + 2 - 3 =
1+2-3=
1+2−3=
1+2‒3=

It looks as if the TeX markup HTML rendition uses a plain old hyphen for a minus sign. I think the − works better in equations, since that is what it is designed for; to be the same width and height as the plus and equal signs. Hyphens and figure dashes look obviously bad in comparison. The figure dash doesn't even have space around it in my font. (8‒8‒0‒0)

So:

# Should these be separated in the two articles?
# I have been using the − in my math articles. Is that ok?
# Should the TeX renderer use the minus sign too?

- Omegatron/Omegatron 21:56, Mar 16, 2004

:Figure dashes are not minus signs, that is a mistake. To answer your questions:
:# Yes. I'm doing so now.
:# Using − is very good even, when used in mathematical operations.
:# Probably. This is likely a font or encoding issue: the hyphen-minus is overused mainly because it is the only dash-like character known to exist on all platforms as it is in ASCII. If possible, Tex of course should use the real character.
:Darkelf/— Jor User talk:Darkelf/(Darkelf) 22:10, 16 Mar 2004

::2. Mostly, I was asking if it should be used, since it might not be supported by enough modern browsers. - User:Omegatron/Omegatron

:::It should be compatible with all modern browsers. I can't check archaic browsers like Netscape4, but any more recent browser should work. I know from personal experience the following support &minus:
:::*Opera (3.5 and up)
:::*Mozilla (any version), Netscape 6/7 and up, and other Geckos,
:::*Internet Explorer 4 and up.
:::*The Lynx and Links terminal browsers (ASCII approximations)
:::Darkelf/— Jor User talk:Darkelf/(Darkelf) 15:20, 17 Mar 2004



emdashes

em dashes — dashes the width of an "m" character — are used in typesetting and "upscale" web page set off text, in a use similar to parentheses. where em dashes are unavailable, two "regular" dashes are often used:

While em dashes look much nicer, on certain displays and devices such as my handheld they aren't in the character set at all, and are displayed using a placeholder character, often a question mark or unfilled square (a "box").

Should we use nice-looking em dashes, knowing they won't display properly on certain devices, or fall back on the less elegant but more portable double dash?

Thanks. Orthogonal/orthogonal 04:22, 16 May 2004

:Is it that time already? Yeah, ok, we've had the mirror/fork question and the case sensitivity question, I guess we must be back around to dashes. May I suggest we keep it civil this time? More like the time before last than last time. Tim Starling/Tim Starling 04:28, May 16, 2004


::Tim Starling: while it's certainly fine to note that this discussion has come up here before and I understand the annoyance of an old-timer seeing questions arise again and again I do it would be more helpful for you to let us know what the previous consensus was.

:::Sorry. There was a short discussion last August, Wikipedia talk:Manual of Style/Dashes#em-dash and en-dash/here. Then the civil discussion I referred to last December: Wikipedia talk:Special characters#Unicode. Then there was the discussion on wikitech-l in January: [http://mail.wikipedia.org/pipermail/wikitech-l/2004-January/007992.html]. And finally there was the February discussion Wikipedia talk:Manual of Style/Dashes#Dashes/here. The current situation as far as I'm concerned is that either TeX-style conversion or to — conversion will be implemented as soon as someone works out how to do it without breaking various things such as the table syntax. Tim Starling/Tim Starling 04:59, May 16, 2004

:::No problem, and thanks Tim, that's good to know. (however, this doesn't answer Othagonal's original question what should be done about the incompatibility of em dashes with certian devices? Personally, I think it's the device's own damn fault.) Adam Conover/ — User:Adam Conover / Adam Conover User talk:Adam Conover/† 16:47, May 16, 2004

::::In principle we could change the rendering according to user agent, although that may make caching difficult. Obviously we have to think of everyone, not just those trying to view Wikipedia from a mobile phone. Tim Starling/Tim Starling 01:23, May 17, 2004

::Personally, though I use an emdash in my sig, I generally use double-dashes when writing, simply because that's what I'm used to doing most online communities don't support anything else, and '' is the quickest way to write a dash in word and because they're must easier to type. I would guess that double-dashes will continue to be by far the most prevalent regardless of what we decide. Adam Conover/ — User:Adam Conover / Adam Conover User talk:Adam Conover/† 04:33, May 16, 2004

Yeah, I've been against them for some time now as they make editing difficult. Use " instead. RickK/Rick'''User talk:RickK/K''' 04:43, 16 May 2004

Changing '-' to "—"


I don't see why they should all be changed to "—". At large font sizes an "—" is *huge* and looks very ugly. A '-' works just as well and doesn't stand out as much.

Darrien/Darrien 17:19, 2004 May 23

This perceived problem is a perceived ''font'' problem. Argue to have the Wikipedia font changed if you feel its m-dashes are too long. In the meanwhile, if a dash (—) is meant, a dash should be used; if a hyphen (-) is meant, a hyphen should be used. Read any book from a major publisher — you won't find them using hyphens for dashes, either single, as you like them (-), or double (), as most wikipedians seem to. Bad punctuation is ugly to educated people Chameleon/Chameleon 17:35, 23 May 2004

:It is not a font problem at all. I'm surprised an "educated" person would expect a hyphen to have the same boldness as a dash. Perhaps you should concentrate on changing the all the 'U's and 'J's to 'V's and 'I's.
:Darrien/Darrien 17:43, 2004 May 23

::I think it'd be better, more wikilike, to have '' render as '—'. (I agree that a distinction between hyphens and em-dashes should be made.) If there's no burning need to use HTML entities for something, they shouldn't be used. I vote for ''. (Then again, we could go the TeX route and use '-' for the hyphen, '' for the e'''n'''-dash (used for ranges, like 1025), and '-' for the em-dash (used for saparation-like this-of text). Higher-quality typesetting, fits well into the syntax ('' is a horizontal rule, which is kinda sorta like the god-king of all dashes), and no more ugly HTML entities in the wikitext. Sounds like a win to me. Grendelkhan/grendelkhan/User_talk:Grendelkhan/(blather) 18:45, 2004 May 23
::: Yes, I agree that there should be automatic conversion. That way, people could just type " or "-" and correct dashes would appear. That would be great. However, until this is implemented, we will have to use HTML, in the same way that we would have to use if we didn't have '' '' — Chameleon/Chameleon 19:12, 23 May 2004

:::Automated conversion is a good idea. But & m d a s h ; (rendered as —) isn't supported in all browsers, as far as I know. And " seems to me a usenet style for m-dashes, not for n-dashes. (A related probleme is, btw, the difference between "some text ? some extra text ? some text" and "some text?some extra text?some more text" in different languages.) Tillwe/''till we'' ☼☽ / User_talk:tillwe/Talk 18:54, 23 May 2004

::::& mdash; displays correctly in all modern browsers these days. Let's work for forward, nor backward compatibility. Also bear in mind that "-", " or "-" for dashes do not display correctly (—) in ''any'' browser. — Chameleon/Chameleon 19:12, 23 May 2004

:::::We ''did'' have automatic conversion up and running for a couple of months ago: really cool, except that it broke the wikitable formatting commands and was deactivated in a matter of minutes. It'd be nice to have it back, but I'm afraid I'm not a programmer. Re the em-dashes and how they can look too long (a lot longer than a capital M here in Tahoma): the alternative is to use en-dashes (spaced), which is sanctioned by a fair number of schools of typography and punctuationalists. Hajor/–''Hajor'' 01:14, 24 May 2004

::::::Ah, that's a pity. Surely it can be programmed not to break wikitables. I hope the programmers are reading this. I've just been looking at a few fonts. It seems a lot have over-long m-dashes. Verdana, Arial Narrow, Book Antiqua, Comic Sans MS, Lucida Sans, High Tower Text, Trebuchet MS and others are okay though. Having said that, although dashes should not theoretically be any longer than an em, that fact that professional font designers make them a little longer shows that they agree with me in saying that it is more elegant for them to be too long than too short. In any case, I think it's clear that those who believe it's okay to put just a hyphen are in a typographically-challenged minority. Darrien, please put the pages back how you found them, and stop turning my dashes into hyphens. — Chameleon/Chameleon 09:44, 24 May 2004

:::::I like the "blank en-dash blank" proposal (written " " or " – ", possibly automatically transformed). Could we make a Wikipedia policy that this form should be used? – Tillwe/''till we'' ☼☽ / User_talk:tillwe/Talk 16:16, 24 May 2004

:::::I prefer hyphens to any form of dash, but I'd be far less opposed to the spaced endashes idea than to emdashes. Angela/Angeluser talk:Angela/a. 17:09, May 24, 2004

::::::I would accept spaced endashes. That would seem to be the only compromise that can be reached. emdashes look horrible at large font sizes. endashes actually look slightly better than hyphens at large font sizes.
::::::Darrien/Darrien 17:32, 2004 May 24

:::::::Spaced en-dashes are a semi-acceptable approximation to the true em-dash. The most important thing is that dashes must never be turned into hyphens, whether single or double. Note that Darrien is ''still'' doing this.
:::::::I know points should be argued, not proved by action... which is the only thing stopping me adding ''lol'' to every page and defending it on the basis that it is the internet standard for funny things. — Chameleon/Chameleon 03:41, 25 May 2004

::::::::Does this mean that you will accept the hyphens being changed to endashes?

::::::::Darrien/Darrien 04:14, 2004 May 25

::::I'd wholeheartedly support making spaced endashes policy. How would we go about that? Hajor/–''Hajor'' 04:18, 25 May 2004

:::::Add it to the page. See if you get reverted. If you don't, it's policy. ;) Angela/Angelauser talk:Angela/. 11:38, May 26, 2004

:::::Hang on — this en-for-em hack only works if en-dashes are significantly longer than hyphens in this font, and they are not. We need to keep using correct em-dashes, and perhaps consider changing the font. This is what I see around
:::::* ''some text-more text''
:::::* ''some text - more text''
:::::* ''some text -more text''
:::::* ''some text- more text''
:::::* ''some textmore text''
:::::* ''some text more text''
:::::* ''some text more text''
:::::* ''some text more text''

::::: I'll carry on correcting all of these to:
:::::* ''some text — more text''
::::: ...though I'll leave en-dashes if I find them. Chameleon/Chameleon 09:12, 26 May 2004

::::::Please stop "correcting" them. There is clearly no consensus that these are correct, and certainly no agreement that they should be used. Anyway, aren't spaced emdashes are always wrong? Angela/Angelauser talk:Angela/. 11:38, May 26, 2004

:::::::The no-spaces thing is the old rule. Dashes have at the very least a hair-space these days, and it's a good thing too (because they are not hyphens).

:::::::Should I stop "correcting" ''teh'' for ''the'' before I get specific consensus for it too? No. I should just carry on proofreading, whilst keeping an eye on any new policies that arise.

:::::::If I see old-fashioned spaceless em-dashes (1), or new-fangled spaced en-dashes (2), I'll leave them; but various hyphen hacks will be changed to (3).
:::::::# ''some text—more text''
:::::::# ''some text – more text''
:::::::# ''some text — more text''
:::::::Chameleon/Chameleon 15:52, 26 May 2004

::::::::Why change them to emdashes when there is far more agreement on spaced endashes? The compromise suggested above would be far more sensible than purposefully inserted characters which many users have stated look awful on their browsers. Angela/Angelauser talk:Angela/. 22:56, May 26, 2004

:::::::::Please, no. Using hyphens can be excused as ignorance, but purposefully using en dashes when you know an em dash is called for is a sin against typography. Stevenj/—Steven G. Johnson 04:02, Jun 17, 2004

::Unspaced emdashes is—pretty much—an article of faith among those wikipedians who follow the Chicago Manual of Style. I suspect that on seeing your spaced ems, they'll feel a strong temptation to join 'em up. Faced with a spaced endash, the temptation would be less strong and the effort required greater. Chameleon/Chameleon, think about that one — which do you prefer, unspaced ems or spaced ens? (And while you're at it, take good note of User:Angela/Angela's comments there about the near-consensus we were approaching with spaced endashes: much closer than anything I've ever seen in the four or five times this issue has come up since I've been here.) User:Hajor/–''Hajor'' 23:22, 26 May 2004
:::The consensus I see 'on the ground' is that those Wikipedians who know how to use these HTML entities use spaced em-dashes.
:::As for which I prefer, the main thing I want is hyphens not to be used. Chameleon/Chameleon 00:23, 27 May 2004

UTF-8 can end this debate

Ever since I read the informative (and nit-picky, just my style) Dash (punctuation) article, I've been cheerfully using typographically correct dashes in my edits. And seeing as I can easily enter them as a single character, I've done so. Same goes for curly quotes, apostrophes, etc.: I've been typing them in directly to edit boxes.

I've just discovered that in using these characters I am quite the shill for Microsoft [http://www.cs.tut.fi/~jkorpela/www/windows-chars.html] — the funny thing is, I'm editing with Firefox on a Mac! I suppose that technology has marched on, at least in some operating systems, allowing me to join in on the Windows party of using fancy punctuation marks that aren't part of the ISO-8859-1 standard. I recognize that this is wrong, since the English Wikipedia claims to use the ISO-8859-1 charset. Sorry.

Once you use edit this way, though, it's hard to go back. There's no need for ugly HTML code in the Wiki markup, just clean, correct typography. On a Mac you type in option key combos to get the symbols; on Windows you edit in Word and let it do its creepy thing, then copy and paste; on Linux, I don't know, you write a program or something. Anyway, it's better than the other options.

Of course, the problem is that the ISO-8859 charset does not have these characters. The solution is to move the English wikipedia to UTF-8, like the French and other language Wikipedias have recently done. At that point we can use almost any single-character symbol we want.

As far as size goes, all ASCII characters still only take up one byte when represented in UTF-8. Characters that are not in ASCII but are in ISO-8859-1, like é, take up 2 bytes instead of 1, but I think that would be balanced by the savings from dashes being a 2 byte code instead of 8 bytes of HTML markup, or two ugly bytes when represented as . Non-European characters (used in interwiki links) would take up less space in UTF-8, too. So in the end it might be an overall space savings, if the conversion is thorough.

Thoughts? This change would probably take some community support, as it can't be much fun changing character sets on the largest Wikipedia. Nathan Hamblen/Nathan 17:08, Jun 10, 2004

:I'll second your opinion. Unicode (UTF8 is the easiest) is the way to go. Buckwad/buckwad 03:38, 2004 Jun 11

::I used to enter special characters directly with my keyboard (Microsoft have released a free too that makes it possible to easily design your own keymap; I took the Spanish one and added a few extra characters), but people complained they couldn't view them, so I had to stop. It would be so good if I could start again. — Chameleon/Chameleon 12:39, 16 Jun 2004



Should non-English words be italicized?
Should non-English words be italicized? (Blackfoot music). Hyacinth/Hyacinth 03:39, 14 Jun 2004

: Yes, according to Use other languages sparingly. Nathan Hamblen/Nathan 05:46, Jun 14, 2004


Proposed section for inclusion in the "Dashes" section

On repeated occasions we have been unable to reach agreement on what styles of dashes are acceptable in Wikipedia articles. In an attempt to avoid further petty dash reversion wars (in which I have readily – and pettily – participated in the past), I'd very much like to work for consensus towards including a paragraph along the following lines in the relevant section of the Manual of Style/Manual of Style. It's basically the same as the approach proposed by User:Chameleon above.

:='''Dash guidelines for Wikipedia editors'''=
:In the interests of Wikilove and pending a planned update of the Wikimedia software that will automatically convert groups of hyphens into the appropriate correct en- and em-dashes, editors are encouraged to be accepting of others' dash preferences and not to modify a chosen style arbitrarily, in the same way as they would refrain from arbitrarily changing "artefact" to "artifact" (or vice-versa). The following five dash styles are currently in use on Wikipedia. Please do not change them to reflect ''your'' preference except as indicated below.
:*Tight (unspaced) em-dashes—like this. Entered by means of either — or &#8212.
:*Spaced em-dashes — like this.
:**A very rare subset of this style separates the dash from the surrounding words using Space (punctuation)/hair spaces; since many browsers cannot display hair spaces, these appear on the display as simple tight em-dashes.
:*Spaced en-dashes – like this. Entered by means of either &ndash or &#8211. (Note: an unspaced en-dash may be used to indicate a range of numbers, but unspaced en-dashes should not be used for the parenthetical use under discussion in this Guideline.)
:*A pair of hyphenseither spaced or unspaced like that. These are ugly, but simple to type, and will be taken care of in the future by the automatic conversion feature; indeed, under that future version of the software, they are expected to become the default style. Editors who do not want the bother of keying in HTML entities are free to type their dashes in this fashion. However, subsequent editors are free to convert any double-hyphens they come across to any of the above three types, depending on:
::*personal preference between en-dashes and em-dashes, and
::*how the hyphens were initially entered i.e., a spaced double hyphen may only be converted into a spaced em-dash or a spaced en-dash. The original editor's spacing preference is respected.
:*A single spaced hyphen - actually, there's no real reason to flout the rules of good typesetting in this way. If you come across one of these, please feel free to convert it into your preferred dash style from the above list.

Is there anything truly objectionable in this proposal? May I please include it in the Manual of Style? We've been through all the arguments before, several times; we all have our preferred ways of doing things and, until the software gets updated and we get automatic hyphen conversion, including this guideline in the Manual might help reduce the odd hot-spot of editing heat and, at the same time, improve the professional appearance of our articles. Please, comments, objections, amendments below. Hajor/–''Hajor'' 22:48, 14 Jun 2004

:Being a stickler for good spelling, grammar and punctuation, I obviously agree with this. — Chameleon/Chameleon 12:36, 16 Jun 2004

:That seems quite reasonable to me, other than that it should be acceptable to substitute one's own preferred hyphen style along with one's own preferred spellings over that of the original editor when greatly expanding or greatly changing an article – for example when expanding a stub or doing a complete reworking of an article. But not when making minor changes or additions.

:My own dash style, however, is to use " – " to keep the spaced en-dash (or spaced em-dash) with the preceding word in case of a line break. Jallan/jallan 23:58, 14 Jun 2004

::Yes, I too often find it helpful to use a non-breaking space to keep the dash in the right place. — Chameleon/Chameleon 12:36, 16 Jun 2004

:I feel this is a good addition. I've had it with people not respecting my double hyphens. Meelar/User:Meelar/Meelar User talk:Meelar/(talk) 20:19, 15 Jun 2004

:After my perusual of the 2000 ''Chicago Manual of Style'' yesterday, I'm glad to see such a succinct, simplified, and inclusive policy suggested here. (I had forgotten how many kinds of dashes there were in typesetting!) I'm all for it, even more so with Jallan's amendment. One question: will the new software actually convert the text, or just display two hyphens as an m-dash? And (okay, two questions) will it (or can it be made to) incorporate Jallan's clever use of &nbsp? — Jeffq/Jeff Q 20:34, 15 Jun 2004

::As I understand it, this new software we are hoping for will interpret double hyphens as special wiki-markup for dashes. At that point, it will be possible for someone to write a script to change all the HTML entities (&mdash; and &#8212;) into double hyphens, since these double hyphens will in any case be converted into the appropriate entity (or indeed, directly insert correct dashes as Unicode if we go over to UTF-8) for viewing in the user's browser, and the use of entities will therefore be unnecessary and undesirable — just as we prefer the use of double apostrophes (''''text'''') to the use of HTML (<em>text</em> or <i>text</i>) when italics are required, now that the software supports this.

::N.B. Some argue that triple hyphens - rather than double ones - should be the markup for dashes, because this leaves " free to be the markup for those little en thingies ("–"), in keeping with TEΧ style. Personally, I don't mind either way. — Chameleon/Chameleon 12:36, 16 Jun 2004

::: An excellent proposal. I heartily endorse it. Tannin/Tannin 14:56, 16 Jun 2004

::: PS: As we all know by now, the ndash and the mdash are different things and are used for different purposes: i.e., the software should convert " and &ndash and – to '''–''', and convert "-" and &mdash and — to '''—'''.

::: By the way, as one of the strongest supporters of professional standards in our presentation (i.e., correct punctuation), and in the interests of harmony, I some time ago voluntarily switched from the no-space mdash style to the spaced style, as this seems to incur less wrath from the typographically challenged ... er ... sorry ... I mean as this seems to incur less wrath from my good and helpful Wikibretheren. User:Tannin/Tannin

::: I think the convention of single, double and triple hyphens is a good convention that is easy and unambiguous to convert by software.
::: Ruhrjung/Ruhrjung 02:47, 2004 Jun 17

::::As a LaTeX user, I'm inclined towards and - for en and em dashes myself as well. Regardless of the specific syntax, though, it's important that a simple markup style be present for ''both'' dash types, since both have common and important typographical uses. Of course, ''most'' users will probably not use either one correctly, but it should be easy to correct them and the resulting wikitext should be readable. (PS. Who thinks that a spaced en dash is an acceptable typographical replacement for an em dash, spaced or not???) Stevenj/—Steven G. Johnson 03:48, Jun 17, 2004

:::::Well, I don't like it either but plenty of people here see it as the perfect compromise between us em-dashers and the hyphen mob. — Chameleon/Chameleon 11:13, 17 Jun 2004

:::::''Who thinks that a spaced en dash is an acceptable typographical replacement for an em dash, spaced or not???'' Germans, for one; their typographical convention is "space, en-dash, space" rather than (unspaced) "em-dash". A matter of style and tradition, perhaps.
:::::Oh, and I also wouldn't mind seeing and - being converted into en-dashes and em-dashes, respectively. Pne/pne 12:25, 17 Jun 2004

::::::Yes. I've seen a number of dash recommendation specifications which suggest strongly that either em-dash or space,en-dash,space are equally acceptable. I normally use the second, but without any strong preference for it. Of course when used between quanities with the meaning 'to', an unspaced en-dash is always normal. But there are exceptions, for example "−7–−2" looks far worse than "−7 – −2". Actually both look rather horrible. But in any case I agree that '' for en-dash and '-' for em-dash is the best known easily readible convention that provides the most control. Presumably the new software would first run a pass and change all hard " to "-" and then all " - " to " " before beginning the new interpretation. Jallan/jallan 14:24, 17 Jun 2004

::::::While we're mentioning other languages, French uses spaced em-dashes. Spanish uses em-dashes spaced on one side, just like parentheses or brackets. — Chameleon/Chameleon 15:35, 17 Jun 2004



Dashes (moved from the village pump)
Opinions are being sought regarding a proposal on this thorny issue for inclusion in the Manual of Style. Please see the bottom of Wikipedia talk:Manual of Style/Dashes. Thanks. Hajor/–''Hajor'' 20:15, 15 Jun 2004

:That huge and technical debate (still) seems to say that a software update will convert simple minus signs that everyone has one their keyboard into these funny entities that are not at all intuitive. One of the hugely disagreeable things about this debate is that usability (the essence of wikitext) seems to get repeatedly ignored, and we go round in circles about trivial details. Pcb21/Pcb21/ User_talk:Pcb21/Pete 08:07, 16 Jun 2004

::You don't have to care about typography, but some editors do, and when they fix your typography it will be much easier for them, others, and ''you'' if they can write - instead of &mdash; (—) and instead of &ndash; (–). Stevenj/—Steven G. Johnson 06:07, Jun 17, 2004

:::Right, that would be a software update extending the wiki-syntax. The discussion however seems to be about what combination of &mdash and &ndash to use right now. Pcb21/Pcb21/ User_talk:Pcb21/Pete 18:49, 17 Jun 2004

::::No, it is not about that. Except for one comment no-one has disagreed with Hajor's assertion that "&mdash;", " &ndash; " and even " &mdash; " are all acceptable. The discussion is on whether Hajor's modifications should replace the current statement which in any case already recommends use of &mdash; and &ndash;. There is discussion on what coding should be used which is another matter. Nor does using &emdash; instead of typing " or " - " in any way make any Wikipedia text less usable unless you mean usable in a strict ASCII text environment which barely exists any more. Jallan/jallan 14:56, 18 Jun 2004

Can people be bothered?
I know I'm asking to get lynched posting this here, but can most editors really be bothered to do anything special just to get a dash that looks the "right" length? I certainly can't. I'm all for correct spelling, punctuation and grammar, but I will stop short of fretting about whether my dash is the correct length. I don't perceive any difference when reading between hyphens, m and n dashes and whatever else you have (by that I mean it doesn't alter the way I would read a sentence, unlike other misplaced or misused punctuation). When I'm writing I will use just a single hyphen/minus sign with a space either side - like this - when perhaps I "should" use a dash. It's easy to type and easy to read (both as source and when rendered on the page), and I frankly can't be bothered to go find the style guide every time I type a dash to check the correct way of doing it. Maybe you could blame my laziness on Microsoft Word for autocorrecting all my hyphens to dashes.
Anyway, if you really have nothing more productive to do you're welcome to "correct" all my "misused" hyphens. But I'm sure Wikipedia could benefit from your time more if it were spent improving the content of articles rather than worrying about exactly what the dashes look like.
Tjwood/Tjwood 17:34, 16 Jun 2004

:I would hope that most editors would be in favor of anything that improves the look and readibility of their text, including using dashes rather than hyphens when appropriate. Looking good is important. Your style is ''substandard'' in the technical sense. I don't believe you will find support for use of " - " as a dash in ''any'' normal style sheet. Instead you will find that use severely deprecated. Check also http://www.google.com/search?q=publisher+submission+dash&sourceid=mozilla-search&start=0&start=0&ie=utf-8&oe=utf-8 for specification after specification that in submissions to publishers a normal dash ''must'' be indicated by ". If typing "&emdash;" is too difficult for you, then use " accordingly. That has been the standard keyboard substitute for dash for at least 60 years and is also more efficient to enter than " - ". As you point out, even MS-Word corrects use of " - " as an error. That this is unimportant to you is irrevelant. It is important to others that Wikipedia text looks like it was written and ''edited'' by knowledgeable people. If you use non-standard style of any kind, whether punctuation, spelling, typographical style, grammar ... then you can expect to have your work edited to fit the standards. And you shouldn't expect those who know better to accept a generally deprecated style any more than they would accept a generally deprecated spelling. You seem to be suggesting that because you and some others don't know the rules and conventions and don't want to know the rules and conventions, that the rules and conventions don't matter. It doesn't work that way. Jallan/jallan 01:33, 17 Jun 2004

:TJ, you are right that most editors cannot be bothered (nor should they) to understand the correct usages of the various dash types—the most important thing is to get good writing and even better content. However, ''some'' editors ''will'' want to correct the typography, and it's important that they can do so (a) easily and (b) in a way that still produces readable wiki, so that editors like you are not too confused. Stevenj/—Steven G. Johnson 03:56, Jun 17, 2004

::…and (c) it is important that people must ''not'' convert (my!) correct dashes into hyphens, as has been happening. We need a policy in favour of correct punctuation to defend Wikipedia against silly edit wars. — Chameleon/Chameleon 11:13, 17 Jun 2004

:Sorry, having re-read my first comment I don't think I made it quite clear. When I'm writing text that will be printed - in MS Word for instance - I will use an unspaced double hyphen which Word will automatically convert to an m-dash. Word will also convert a spaced hyphen to a spaced n-dash such as for date ranges etc. But for Web use where I'm limited to plain ASCII or otherwise using special entities, I use a spaced hyphen because it's too much hassle to look up the correct entities to use and the source looks a mess with entities all over the place. A double dashlike thislooks more ugly than a spaced hyphen.
:If Wikipedia gets around to replacing double dashes with the correct entities when the pages are displayed (in a similar manner to Word), but leaving the source code readable, I will happily use them. But until that point I will stick to my "incorrect" spaced hyphens.
:As for printed versions of Wikipedia, surely any possible printed version would need fairly widespread editing first to make things consistent anyway? I've seen different pages on Wikipedia written using completely different styles of layout etc, never mind just dashes. You can get away with it no problem on the web, but when it comes to paper it would look a mess without one fixed style decided by the publisher of the paper version.
:Tjwood/Tjwood 13:14, 17 Jun 2004

:: Actually, you know, I've just looked at what I posted above, and I now think I was wrong when I said "''A double dashlike thislooks more ugly than a spaced hyphen.''" The double dash looks better. You may have just converted me. :-) Tjwood/Tjwood 13:20, 17 Jun 2004

:::It doesn't really matter whether your hyphens are single or double, spaced or tight — just as long as you don't mind the rest of us correcting them to dashes. In the same way, it doesn't matter too much whether you misspell words when you make valuable factual contributions to the encyclopaedia — just as long as you don't mind us cleaning them up afterwards. — Chameleon/Chameleon 15:27, 17 Jun 2004

::::It may matter for ESL readers of Wikipedia. I am not sure, but I think the unspaced dashes, when used as parenthesis, are globally a relatively uncommon phenomena. Or at least, they introduce a resistance in the reading process for readers who aren't used to them. And unspaced dashes are not, let's be honest now, particularly common on the world wide web. I would favor a standard-rendering of spaces before and after m-wide dashes.Ruhrjung/Ruhrjung 18:56, 2004 Jun 17

:::::At this point I'd like to say that dashes are often poor punctuation anyway. Most of the time, the sentence is better with a comma, semi-colon, colon, full stop or parenthesis. — Chameleon/Chameleon 19:28, 17 Jun 2004

Consistency within articles

Having looked at the first ever printed version of Wikipedia (the German de:WikiReader/Internet/WikiReader Internet), I've noticed how awful mixing hyphens and varying length dashes within articles is. It's bad enough between articles, but can we please have a policy of not mixing them. I think consistency is more important than deciding which dashes are used. Angela/Angelauser talk:Angela/. 12:43, 17 Jun 2004

:That's fine, as long as it is not used as an excuse to justify bad punctuation and convert dashes into hyphens. This is really just an extension of the existing policy on spelling: correct mistakes, respect other people's variations within what is correct, try to keep usage consistent within articles even if usage differs between them... — Chameleon/Chameleon 15:27, 17 Jun 2004

:That would be very good, and according to other wikipolicies I believe in (like consistent British or American spelling per article).Ruhrjung/Ruhrjung 18:41, 2004 Jun 17

: Isn't consistency kind of a lost cause on Wikipedia? Which, if you ask me, is the way it probably ought to be?parts of articles suck, other parts are better, but overall it's getting pulled in the right direction. It'll never be complete. ''Wikisux/Wikisux 15:03, 23 Jun 2004 ''

Going live with policy

The proposed text (above) appears to enjoy consensus support, so I'll be inserting it into the Manual of Style shortly. The outstanding issue to include, if anyone fancies adding in the extra language, is consistency within a given article: the guideline given for US/UK spelling conventions is follow the standard set by the first major (non-stub) edit; the logical approach for dashes would be to follow the dash style used by the first editor to include a dash in the article.

Changing to explicit &mdash; and &ndash; entities now (while admittedly not as transparent as we might wish wikitext to be) will also save us a lot of anguish and conflict when the promised automatic conversion comes on line, when presumably we'll switch to using two- and three-hyphen chains in the markup either manually or (as suggested above) by someone running a bot. ("Conflict" in the sense that a lot of people have been typing " in the expectation that they'll autoconvert into emdashes, when from what we saw a few months ago when the function was briefly activated, it'll be three hyphens for an emdash and two for an endash, similar to the TeX convention described above.)

Everybody happy? Hajor/–''Hajor'' 10:53, 28 Jun 2004

:The trouble is that, if the software interprets double hyphens as en-dashes rather than em-dashes, and we have a policy of not changing other people's punctuation unless it's really bad (i.e. they use hyphens for dashes), then we'll have a lot of " that the author wanted to represent em-dashes that will now show up as en-dashes and our policy will specifically forbid us from manually turning these " into "-" (so that — rather than – appears).

:So, the policy ''seems'' fair, but a technical glitch will unfairly favour the use of en-dashes. — Chameleon/Chameleon 11:32, 28 Jun 2004

No the third bullet says subsequent editors are free to change double hyphens to their choice of em-dashes or en-dashes (providing the original writer's spaced/unspaced preference is respected). That was intended precisely to preempt that type of conflict when the auto-conversion comes on line: make them explicit now, and then change them back to either (1) "-" & " or (2) " and " - " following the upgrade. Hajor/–''Hajor'' 12:51, 28 Jun 2004

:No, Hajor, you're not getting me here. I understand that the policy says that ''now, in the short term'' I can turn hyphen hacks into real dashes. That wasn't my concern. I was concerned with how this policy would interact with future software that would automatically display " and "-" in the markup as "–" and "—" on the page.

:At that point in time, the policy you are advocating would obviously be largely obsolete. The preferable way to put correct dashes into text will be, at that point, by using double and triple hyphens in the markup. Using "&mdash;" etc will be deprecated, in the same way that we prefer the use of double inverted commas instead of "<i>". That is to say, we only use HTML to do something when there is no wiki markup for it. It would be a good idea, at that point, to write a script that goes through Wikipedia changing all instances of "&mdash;" to "-". This would not affect the display of the pages in the slightest; it would just make the markup easier to read.

:Follow me so far?

:My point is that most of the policy would be obsolete, except for the part about not changing other people's punctuation (from one of the three acceptable forms of dashes to one of the others). I think people would argue that this part still stands. The problem with this is that, although those em-dashes originally entered into the text as "&mdash;" will now show up in the markup as "-" and on the page as "—", there will also be many instances of dashes '''originally entered into the markup as " with the author's intention being to enter an em-dash, which would at that point show up on the page as "–".

:There would therefore be thousands of en-dashes that the original authors wanted to be longer. And policy would prevent me from changing the markup from " to "-" to make them display as "—" instead of "–". Chameleon/ — Chameleon User:Chameleon/My page/User talk:Chameleon/My talk 19:55, 30 Jun 2004

:::Seems to me this whole mess could be avoided by turning " into em-dashes, as everyone expects, and " - " (hyphen surrounded by spaces) into en-dashes. Reverse compatibility, preserving authors' intent, and intuitive. (The three dashes proposal I find ridiculous.) ''Wikisux/Wikisux 21:19, 30 Jun 2004 ''

I've said this before, but I wonder if I am being understood: we should NOT use dashes that are created in source with characters that the average person cannot understand. The average person has NO IDEA what HTML entities are. Please put this ahead of nit-picking over correct and pretty typography. Agree on a way to represent the various typographical elements with "-", ", "-" and spaces. Tarquin/Tarquin 22:34, 28 Jun 2004

:You've said it before, you've been understood, but the consensus shown above indicates that you're simply not being agreed with. And I think you're insulting the intuitive abilities of the "average person", too. And why on Earth didn't you contribute your views in the two weeks since I posted the proposed text? Please could you try to live with the HTML entity solution until the automatic conversion function is activated (when we are sure to start using nice, transparent strings of hyphens)? Hajor/–''Hajor'' 23:09, 28 Jun 2004

::I'm also suspicious of ''average person'' arguments, especially when almost all Wikipedians (and logically almost 50% of them are below the Wikipedian average) are able to master the other various stylistic features of Wikipedia source code. I think that even the average computer user today does know what HTML entities are. The forms &mdash; and &ndash; are hardly unituitive. But many Wikipedian editors find them ugly and annoying to type and some don't care about typography and some don't want to understand such things, balanced by those who very much care. But no-one wants to see "-" in the final presentation or " in a range, such as 19121914, in the final presentation. Accordingly we can't use such constructs before new interpretation software is applied. Jallan/jallan 00:20, 29 Jun 2004

:And that is in fact what was mandated by the text of the Manual of Style/Manual of Style as it stood when I made the suggestion two weeks ago: "Historically a double hyphen () was used to represent an em dash because on a typewriter the hyphens tend to connect, creating a dash in appearance, but since this is almost never the case in ASCII do not use this." It's still there now, if you care to take a look. And no, it wasn't me who put it there. Hajor/–''Hajor'' 04:21, 30 Jun 2004

I apologize if this has already been brought up, but I'd suggest replacing double hyphens () with em dashes, and single hyphens surrounded by spaces ( - ) with en dashes. This is the http://www.textism.com/tools/textile/ standard (if you've ever used Movable Type) and it makes more sense than any of the other ideas I've seen here. '''edit:''' Oh, I see it ''has'' been brought up already. I throw my weight behind this proposal, then. ''Wikisux/Wikisux 04:40, 30 Jun 2004 ''

Quotation Dash / Horizonatal bar

The Unicode Standard (v4.0) on page 155 in chap. 6 has the Quotation Dash (U+2015 / ―) to be called Horizontal Bar and gives Quotation Dash as an alternate name but officially referrs to it as Horizontal Bar. Of course, calling it Horizontal Bar would conflict with the html tag <HR> which the resulting graphic people seem to want to call horizontal bar too, when its really called a Horizontal Rule by the W3C standard. So yeah... Call U+2015 a quotation dash, a horizontal bar, or both? Keeping with the Unicode Standard for naming specific code characters would be best. Kevin Breitenstein/Kevin Breitenstein 05:43, 3 Jul 2004

Software

I’ve come late into this debate, but I’m pleased with the way it seems to have turned out. I’d like to wish the people writing the conversion software the best of luck. It looks like a wicked problem, but there are some good precedents for a solution.

* http://www.textism.com/tools/textile/, written in PHP, was already mentioned. It does dashes, smart quotes and lots of other character conversions. It even supports a table formatting scheme similar to wikitables. There’s also a http://bradchoate.com/mt-plugins/textile, written in Perl.
* http://daringfireball.net/projects/smartypants/, in Perl, is a simple processor which converts dashes, smart quotes, and ellipses. There's also a http://www.michelf.com/projects/php-smartypants/.
* http://daringfireball.net/projects/markdown/ is a more full-featured text processor by the same author.

It was inspiring to see Wikipedians hash out a solution that respects everyone’s wishes and maintains a ''high standard of typography''.

“Apostrophes and quotation marks, anyone?”

—Mzajac/Michael Z. 06:04, 2004 Aug 27