pdflatex, breakurl and unicode characters











up vote
7
down vote

favorite












I am trying to write an URL using breakurl, that includes an accented character. I am trying to use pdflatex with utf8 inputenc (also tried utf8x). But I can't get a decent result.



My best result is presented by this minimal working example:



documentclass{article}

usepackage[utf8x]{inputenc}
usepackage{hyperref}
usepackage{url}

begin{document}
url{detokenize{http://coração.net}}

end{document}


The link is created correctly, but the text shown in the PDF is not the correct one (misses the non ascii characters).



Suggestions are welcome.










share|improve this question














bumped to the homepage by Community 12 mins ago


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.















  • detokenize will completely break pdflatex's handling of utf-8.
    – David Carlisle
    Dec 18 '17 at 16:20










  • Sure, but removing it doesn't help too.
    – Alberto
    Dec 18 '17 at 16:26










  • Why do you need that? I don't think url with non-ascii characters are supported anywhere (feel free to correct me about this). Also there is no need to load url if you load hyperref.
    – Skillmon
    Dec 18 '17 at 16:33






  • 6




    href{detokenize{http://coração.net}}{texttt{http://coração.net}}
    – user91669
    Dec 18 '17 at 16:36






  • 3




    @Skillmon unfortunately unicode domains are out there, and they are a big danger for phishing, as there are ways to make similar looking urls pointing to different places. Look here: wordfence.com/blog/2017/04/chrome-firefox-unicode-phishing
    – Alberto
    Dec 18 '17 at 16:41















up vote
7
down vote

favorite












I am trying to write an URL using breakurl, that includes an accented character. I am trying to use pdflatex with utf8 inputenc (also tried utf8x). But I can't get a decent result.



My best result is presented by this minimal working example:



documentclass{article}

usepackage[utf8x]{inputenc}
usepackage{hyperref}
usepackage{url}

begin{document}
url{detokenize{http://coração.net}}

end{document}


The link is created correctly, but the text shown in the PDF is not the correct one (misses the non ascii characters).



Suggestions are welcome.










share|improve this question














bumped to the homepage by Community 12 mins ago


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.















  • detokenize will completely break pdflatex's handling of utf-8.
    – David Carlisle
    Dec 18 '17 at 16:20










  • Sure, but removing it doesn't help too.
    – Alberto
    Dec 18 '17 at 16:26










  • Why do you need that? I don't think url with non-ascii characters are supported anywhere (feel free to correct me about this). Also there is no need to load url if you load hyperref.
    – Skillmon
    Dec 18 '17 at 16:33






  • 6




    href{detokenize{http://coração.net}}{texttt{http://coração.net}}
    – user91669
    Dec 18 '17 at 16:36






  • 3




    @Skillmon unfortunately unicode domains are out there, and they are a big danger for phishing, as there are ways to make similar looking urls pointing to different places. Look here: wordfence.com/blog/2017/04/chrome-firefox-unicode-phishing
    – Alberto
    Dec 18 '17 at 16:41













up vote
7
down vote

favorite









up vote
7
down vote

favorite











I am trying to write an URL using breakurl, that includes an accented character. I am trying to use pdflatex with utf8 inputenc (also tried utf8x). But I can't get a decent result.



My best result is presented by this minimal working example:



documentclass{article}

usepackage[utf8x]{inputenc}
usepackage{hyperref}
usepackage{url}

begin{document}
url{detokenize{http://coração.net}}

end{document}


The link is created correctly, but the text shown in the PDF is not the correct one (misses the non ascii characters).



Suggestions are welcome.










share|improve this question













I am trying to write an URL using breakurl, that includes an accented character. I am trying to use pdflatex with utf8 inputenc (also tried utf8x). But I can't get a decent result.



My best result is presented by this minimal working example:



documentclass{article}

usepackage[utf8x]{inputenc}
usepackage{hyperref}
usepackage{url}

begin{document}
url{detokenize{http://coração.net}}

end{document}


The link is created correctly, but the text shown in the PDF is not the correct one (misses the non ascii characters).



Suggestions are welcome.







pdftex unicode urls






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Dec 18 '17 at 16:12









Alberto

16411




16411





bumped to the homepage by Community 12 mins ago


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.







bumped to the homepage by Community 12 mins ago


This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.














  • detokenize will completely break pdflatex's handling of utf-8.
    – David Carlisle
    Dec 18 '17 at 16:20










  • Sure, but removing it doesn't help too.
    – Alberto
    Dec 18 '17 at 16:26










  • Why do you need that? I don't think url with non-ascii characters are supported anywhere (feel free to correct me about this). Also there is no need to load url if you load hyperref.
    – Skillmon
    Dec 18 '17 at 16:33






  • 6




    href{detokenize{http://coração.net}}{texttt{http://coração.net}}
    – user91669
    Dec 18 '17 at 16:36






  • 3




    @Skillmon unfortunately unicode domains are out there, and they are a big danger for phishing, as there are ways to make similar looking urls pointing to different places. Look here: wordfence.com/blog/2017/04/chrome-firefox-unicode-phishing
    – Alberto
    Dec 18 '17 at 16:41


















  • detokenize will completely break pdflatex's handling of utf-8.
    – David Carlisle
    Dec 18 '17 at 16:20










  • Sure, but removing it doesn't help too.
    – Alberto
    Dec 18 '17 at 16:26










  • Why do you need that? I don't think url with non-ascii characters are supported anywhere (feel free to correct me about this). Also there is no need to load url if you load hyperref.
    – Skillmon
    Dec 18 '17 at 16:33






  • 6




    href{detokenize{http://coração.net}}{texttt{http://coração.net}}
    – user91669
    Dec 18 '17 at 16:36






  • 3




    @Skillmon unfortunately unicode domains are out there, and they are a big danger for phishing, as there are ways to make similar looking urls pointing to different places. Look here: wordfence.com/blog/2017/04/chrome-firefox-unicode-phishing
    – Alberto
    Dec 18 '17 at 16:41
















detokenize will completely break pdflatex's handling of utf-8.
– David Carlisle
Dec 18 '17 at 16:20




detokenize will completely break pdflatex's handling of utf-8.
– David Carlisle
Dec 18 '17 at 16:20












Sure, but removing it doesn't help too.
– Alberto
Dec 18 '17 at 16:26




Sure, but removing it doesn't help too.
– Alberto
Dec 18 '17 at 16:26












Why do you need that? I don't think url with non-ascii characters are supported anywhere (feel free to correct me about this). Also there is no need to load url if you load hyperref.
– Skillmon
Dec 18 '17 at 16:33




Why do you need that? I don't think url with non-ascii characters are supported anywhere (feel free to correct me about this). Also there is no need to load url if you load hyperref.
– Skillmon
Dec 18 '17 at 16:33




6




6




href{detokenize{http://coração.net}}{texttt{http://coração.net}}
– user91669
Dec 18 '17 at 16:36




href{detokenize{http://coração.net}}{texttt{http://coração.net}}
– user91669
Dec 18 '17 at 16:36




3




3




@Skillmon unfortunately unicode domains are out there, and they are a big danger for phishing, as there are ways to make similar looking urls pointing to different places. Look here: wordfence.com/blog/2017/04/chrome-firefox-unicode-phishing
– Alberto
Dec 18 '17 at 16:41




@Skillmon unfortunately unicode domains are out there, and they are a big danger for phishing, as there are ways to make similar looking urls pointing to different places. Look here: wordfence.com/blog/2017/04/chrome-firefox-unicode-phishing
– Alberto
Dec 18 '17 at 16:41










1 Answer
1






active

oldest

votes

















up vote
0
down vote













The following code works with xelatex:



documentclass{article}
usepackage{fontspec,hyperref}
begin{document}
url{http://coração.net}
end{document}





share|improve this answer

















  • 2




    Yep, unfortunately I needed to use pdflatex.
    – Alberto
    Dec 18 '17 at 17:23











Your Answer








StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "85"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2ftex.stackexchange.com%2fquestions%2f406762%2fpdflatex-breakurl-and-unicode-characters%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes








up vote
0
down vote













The following code works with xelatex:



documentclass{article}
usepackage{fontspec,hyperref}
begin{document}
url{http://coração.net}
end{document}





share|improve this answer

















  • 2




    Yep, unfortunately I needed to use pdflatex.
    – Alberto
    Dec 18 '17 at 17:23















up vote
0
down vote













The following code works with xelatex:



documentclass{article}
usepackage{fontspec,hyperref}
begin{document}
url{http://coração.net}
end{document}





share|improve this answer

















  • 2




    Yep, unfortunately I needed to use pdflatex.
    – Alberto
    Dec 18 '17 at 17:23













up vote
0
down vote










up vote
0
down vote









The following code works with xelatex:



documentclass{article}
usepackage{fontspec,hyperref}
begin{document}
url{http://coração.net}
end{document}





share|improve this answer












The following code works with xelatex:



documentclass{article}
usepackage{fontspec,hyperref}
begin{document}
url{http://coração.net}
end{document}






share|improve this answer












share|improve this answer



share|improve this answer










answered Dec 18 '17 at 17:12







user91669















  • 2




    Yep, unfortunately I needed to use pdflatex.
    – Alberto
    Dec 18 '17 at 17:23














  • 2




    Yep, unfortunately I needed to use pdflatex.
    – Alberto
    Dec 18 '17 at 17:23








2




2




Yep, unfortunately I needed to use pdflatex.
– Alberto
Dec 18 '17 at 17:23




Yep, unfortunately I needed to use pdflatex.
– Alberto
Dec 18 '17 at 17:23


















draft saved

draft discarded




















































Thanks for contributing an answer to TeX - LaTeX Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.





Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


Please pay close attention to the following guidance:


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2ftex.stackexchange.com%2fquestions%2f406762%2fpdflatex-breakurl-and-unicode-characters%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Accessing regular linux commands in Huawei's Dopra Linux

Can't connect RFCOMM socket: Host is down

Kernel panic - not syncing: Fatal Exception in Interrupt