Translating foreign characters to HTML code

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • gregf@kcls.org

    Translating foreign characters to HTML code

    Is there a way or a program (for windows) that can translate foreign
    characters inot the proper html code? I have a word document with many
    different characters and I really don't want to spend all the time
    editing it with all the html code ie "&#347ci&#2 61". Certainly someone
    must have a program that can do this automatically.

  • David Dorward

    #2
    Re: Translating foreign characters to HTML code

    gregf@kcls.org wrote:
    [color=blue]
    > Is there a way or a program (for windows) that can translate foreign
    > characters inot the proper html code? I have a word document with many
    > different characters and I really don't want to spend all the time
    > editing it with all the html code ie "&#347ci&#2 61". Certainly someone
    > must have a program that can do this automatically.[/color]

    Just save it with a UTF-8 character encoding. Then you don't need to specify
    UTF-8 characters with character references.

    --
    David Dorward <http://blog.dorward.me .uk/> <http://dorward.me.uk/>
    Home is where the ~/.bashrc is

    Comment

    • Lachlan Hunt

      #3
      Re: Translating foreign characters to HTML code

      gregf@kcls.org wrote:[color=blue]
      > Is there a way or a program (for windows) that can translate foreign
      > characters inot the proper html code? I have a word document with many
      > different characters and I really don't want to spend all the time
      > editing it with all the html code ie "&#347ci&#2 61". Certainly someone
      > must have a program that can do this automatically.[/color]

      You need to learn Unicode.


      --
      Lachlan Hunt

      http://GetFirefox.com/ Rediscover the Web
      http://GetThunderbird.com/ Reclaim your Inbox

      Comment

      • Harlan Messinger

        #4
        Re: Translating foreign characters to HTML code

        gregf@kcls.org wrote:[color=blue]
        > Is there a way or a program (for windows) that can translate foreign
        > characters inot the proper html code? I have a word document with many
        > different characters and I really don't want to spend all the time
        > editing it with all the html code ie "&#347ci&#2 61".[/color]

        That's fine, because it wouldn't work anyway. You forgot the semicolons.

        Comment

        • David Håsäther

          #5
          Re: Translating foreign characters to HTML code

          Harlan Messinger <hmessinger.rem ovethis@comcast .net> wrote:
          [color=blue]
          > gregf@kcls.org wrote:[color=green]
          >> Is there a way or a program (for windows) that can translate
          >> foreign characters inot the proper html code? I have a word
          >> document with many different characters and I really don't want
          >> to spend all the time editing it with all the html code ie
          >> "&#347ci&#2 61".[/color]
          >
          > That's fine, because it wouldn't work anyway. You forgot the
          > semicolons.[/color]

          It _will_ work (for HTML, not XHTML) since the parsing for the numeric
          character reference will end with the first non-digit, in this case
          "c".

          --
          David Håsäther

          Comment

          • Alan J. Flavell

            #6
            Re: Translating foreign characters to HTML code

            On Mon, 27 Jun 2005, David Håsäther wrote:

            [ "&#347ci&#2 61". ]
            [color=blue]
            > It _will_ work (for HTML, not XHTML) since the parsing for the numeric
            > character reference will end with the first non-digit, in this case
            > "c".[/color]

            No. Some client agents may choose that fixup, but I challenge you to
            produce any authoritative specification which requires it.

            Comment

            • David Håsäther

              #7
              Re: Translating foreign characters to HTML code

              Alan J. Flavell <flavell@ph.gla .ac.uk> wrote:
              [color=blue]
              > On Mon, 27 Jun 2005, David Håsäther wrote:
              >
              > [ "&#347ci&#2 61". ]
              >[color=green]
              >> It _will_ work (for HTML, not XHTML) since the parsing for the
              >> numeric character reference will end with the first non-digit, in
              >> this case "c".[/color]
              >
              > No. Some client agents may choose that fixup, but I challenge you
              > to produce any authoritative specification which requires it.[/color]

              The SGML Handbook (353:3) says (emphasis mine)

              | The refc or RE can be omitted only if the reference is _not followed
              | by a character that could occur in the reference_, or by a character
              | that could be interpreted as the omitted reference end.

              Looking at the production for a numeric character reference[62.2] (this
              production was introduced in the Web SGML Adaptations TC) we see that
              after CRO it's followed by "character number" (which is one or more
              digit), i.e., nothing other than digits can occur in a numeric
              character reference, and therefore REFC can be omitted.

              Try it in nsgmls too.

              [62.2] numeric character reference =
              cro, character number, reference end

              --
              David Håsäther

              Comment

              • Alan J. Flavell

                #8
                Re: Translating foreign characters to HTML code

                On Mon, 27 Jun 2005, David Håsäther wrote:
                [color=blue]
                > Alan J. Flavell <flavell@ph.gla .ac.uk> wrote:
                >[color=green]
                > > On Mon, 27 Jun 2005, David Håsäther wrote:
                > >
                > > [ "&#347ci&#2 61". ]
                > >[color=darkred]
                > >> It _will_ work (for HTML, not XHTML) since the parsing for the
                > >> numeric character reference will end with the first non-digit, in
                > >> this case "c".[/color]
                > >
                > > No. Some client agents may choose that fixup, but I challenge you
                > > to produce any authoritative specification which requires it.[/color]
                >
                > The SGML Handbook (353:3) says (emphasis mine)[/color]

                [...]

                OK, I have to admit that this SGML detail was not known to me, and it
                appears you are correct about that.

                However, the HTML specification (parts displayed in black)
                does not include this option - see



                Indeed there is a note (in green) stating:

                Note. In SGML, it is possible to eliminate the final ";" after a
                character reference in some cases (e.g., at a line break or
                immediately before a tag). In other circumstances it may not be
                eliminated (e.g., in the middle of a word). We strongly suggest using
                the ";" in all cases to avoid problems with user agents that require
                this character to be present.

                And all of their recipes and examples include the terminating
                semicolon.

                So although I concede that you may be technically correct on the SGML
                front, I think it's overstating the case to claim that it's true for
                HTML.

                (Modulo the usual arguments about "the HTML specification purports to
                exclude some features of SGML which SGML does not permit to be
                excluded.)

                thanks.

                Comment

                • dwight.stegall@gmail.com

                  #9
                  Re: Translating foreign characters to HTML code

                  Try this acii converter.
                  Get a new domain name for your startup. Quick and professional service. Seamless domain transfers.


                  Comment

                  Working...