Design Review #301

#301: hrefTranslate attribute

Visit on Github.

Opened Aug 23, 2018

Bonjour TAG,

I'm requesting a TAG review of:

Name: hrefTranslate attribute addition
Specification URL: HTML WHATWG pull request
Explainer, Requirements Doc, or Example code: hrefTranslate
Tests: None currently
Primary contacts: @dtapuska

Further details (optional):

Relevant time constraints or deadlines: [please provide]
I have read and filled out the Self-Review Questionnare on Security and Privacy. The assessment is here.
I have reviewed the TAG's API Design Principles

You should also know that...

There has been some debate in whatwg/html#2945 specifically if this feature is useful or creates new problematic scenarios. We believe this feature has useful benefits in surfacing content to users.

We'd prefer the TAG provide feedback as (please select one):

open issues in our Github repo for each point of feedback
open a single issue in our Github repo for the entire review
leave review feedback as a comment in this issue and @-notify [github usernames]

Discussions

Comment by @torgo Oct 30, 2018 (See Github)

We're trying to make some progress on this at our Paris f2f. Unfortunately we haven't made progress on this review. @dtapuska can you send us an explainer reference? Please see our updated explainer explainer.

Comment by @dtapuska Oct 30, 2018 (See Github)

@torgo The explainer is indicated in the first comment and is here: https://github.com/dtapuska/html-translate

Comment by @kenchris Oct 31, 2018 (See Github)

According to https://w3ctag.github.io/design-principles/#casing-rules attributes should be lowercased and concatenated

Comment by @dbaron Oct 31, 2018 (See Github)

I agree that I'm not a fan of reusing the lang attribute for any of this -- the lang attribute is describing the language of the contents of the element, which in this case is the text of the link.

Assuming that the feature is desirable, both options 2 and 3 seem plausible, and it also seems like they could be combined.

I suppose that what's less clear to me here is why this should be something that the page chooses rather than the UA -- the explainer doesn't really make that clear to me.

Comment by @kenchris Oct 31, 2018 (See Github)

would hreftranslate without value do anything? Like for instance translate to the default UA language?

Comment by @kenchris Oct 31, 2018 (See Github)

I would like to understand what are some of the advantages of this? That the browser/CDN can cache translations or that the pages can be pretranslated to minimize bandwidth usage/caching and thus improving the time to interactive? Could the explainer talk more about that?

Comment by @hadleybeeman Oct 31, 2018 (See Github)

Hi all. We've discussed this at our TAG face-to-face in Paris.

We are mainly wondering: when do you imagine this being a decision that you'd want to give to a referring page, and not to the user agent? Currently, if I prefer viewing web pages in French, I will express that preference with an HTTP header (like Accept-Language, for example) and the server will send me French content or translate for me where it can.

As a user, I expect the UA to manage these preferences and transactions for me. When would it be useful to let the referring site make that decision for me?

Could you provide some more real world examples of where you see this user need cropping up? Thanks so much!

Comment by @dtapuska Oct 31, 2018 (See Github)

It is most beneficial when:

The landing page wants to present text in alternate languages and wants to present a machine translated page for those languages. Think how the main wikipedia site has a bunch of text for the language translated into the native language. So even a non-English speaker lands on an English site they can identify the text that indicates their native language. And the site which is comfortable with machine translation can cause the page to actually be translated in that language.
The explainer describes this scenario where searching for text in a language different than any of your language preferences set may return a page that should be translated for the user. ie. Think about "Translate this page" on Google Search; it currently uses server side translation and for better fidelity (because of the use of javascript to generate pages) it could use client side translation.

Comment by @dbaron Oct 31, 2018 (See Github)

One other concern here is that it seems to assume that client side translation is available in all browsers, when that's not actually the case, and I don't think there's a reasonable path to it being the case. It's not clear to me what the recommended fallback is, or how it's supposed to fall back to it.

Comment by @dtapuska Oct 31, 2018 (See Github)

Sorry I should place some things that I believe are common knowledge in the explainer. Since this is a new attribute it is easy to feature detect. (ie; HTMLAnchorElement.prototype.hasOwnProperty('hrefTranslate');)

Comment by @dbaron Oct 31, 2018 (See Github)

Sure, it can be feature detected... but what's the expectation of what pages do when it's not there? Seems like it might be worth talking about that.

Comment by @hadleybeeman Oct 31, 2018 (See Github)

Thanks for the response, @dtapuska. I'll admit, I'm still a little concerned.

The web fundamentally works by the user expressing their preferences through a user agent. (I tell my browser that I want sites in French.) The user agent then negotiates with sites accordingly.

If I (or my UA) has set some preferences with a search engine, for example, ("I'd like my Google results to be displayed in French") -- that is still a relationship I have with that site. The same origin policy is another expression of this mentality; I have told this site something about me that I don't expect it would (or could) share with another.

To explicitly expect this search engine to tell the site I'm clicking on that I'm probably going to want the site in French -- isn't how the web works. It's a violation of my deal with the search engine (since it is leaking information about me) and prevents my user agent from doing its job -- which is to tell that second site how I'd like the content.

Have I misunderstood what you're trying to do here?

Comment by @hadleybeeman Jan 8, 2019 (See Github)

Hi @dtapuska, and happy new year! Just checking on this... any thoughts on my questions above?

Comment by @torgo Feb 5, 2019 (See Github)

Ping @dtapuska ... is this work still live, or should we close this issue?

Comment by @dtapuska Feb 5, 2019 (See Github)

It is alive. We are conducting an experiment in Chrome with exploring this feature.

Let us set aside the fact that a user with a search engine and if the user is signed in might have language preferences as this complicates things and this isn't what I am talking about.

I'm talking about the ability of a page to provide a statically linked translation service.

eg. I'm preparing a page that is in French but I want to link to an article in German knowing my primary audience is French I might provide a href to something like "https://www.microsofttranslator.com/bv.aspx?from=&to=fr&a=https://www.sueddeutsche.de/"

This creates poor server side translations. If I can detect that the user agent supports client side translation I can simply set a hint of the desired language. And if it doesn't match the user settings when the user navigates to it it would translate it.

Discussed Mar 5, 2019 (See Github)

dka: where are we?

dtapuska: Search sometimes redirects pages to server-side translation. That misses out on a whole lot of dynmaic pages. Client-side translation exists in some browsers; produces better experience. Want to avoid server-side translation if we can. Can detect based on prototype chain. Was some discussion in the github issue about whether we should be doing this, whether you know the locale. The server is returning the page, and putting text there and directing you to the server-side translation, so user settings don't come into pplay. The server is controlling things. So I don't think we're taking anything away from the user; allowing use cases that weren't possible before.

hadley: what concerned me when we last looked is that the model of the web is that user gets to control the experience through teh user-agent. Thus the client-server relationship is between the user and the server. This looks like puts a different server in control and takes power away from the user. Specific example?

dtapuska: Example was searching in Google search, searching for a german news site. Ends up with translate attribute that redirects to translate.google.com with url that you were loading. The page you're interacting with is responsible for the next links; that's what we're trying to cnotrol. What's happening with server side translation is that half the page isn't translated (javascript).

dka: the underlying assumption is that the referring site knows something the browser doesn't know? Doesn't the browser send header with its language preferences? What if the server gets two signals that are conflicting? Is the current client-side signal not adequate?

dtapuska: The client-side signal won't translate. What we're after is that search can return pages in any language. If you think about the web from the point of view of non-English, when much of the web is in English. We're trying to enhance the web for that opportunity. If a language is in your accept-language, the server's probably not going to set the hrefTranslate attribute.

hadley: I understand you want the UX in the translate mode to work smoothly; still confused about why to do that in the page rather than in the browser.

dtapuska: in the referrer page, would use hrefTranslate and original href instead of setting href to translate.bing.com?url=... . And translation then works better because it's client-side rather than server-side.

hadley: wouldn't make more sense to do that in http rather than in the referring page?

tess: e.g., browser UI that begins the auto-translating browsing mode.

dka: I still don't understand why accept-language header isn't sufficient. I click link to Der Spiegel, why doesn't Der Spiegel ask me to translate the content to English through its use of Google Translate?

dtapuska: You could handle those pages using accept-language, but a lot of the web doesn't have the resources to deliver that content in different languages.

hadley: Would it make more sense... if you want it to all run in the client... to build a version of translate into the browser?

dtapuska: That's what this is -- that's the client-side translation that occurs.

... it allows the referrer page to serve pages that don't match the language that you can read.

tess: given that the idea is having a browser feature that does translation, presumably with UI in the browser to activate this mode... why does there have to be any difference in the web content at all? Why do I need to cause the translation to happen in my page content? Why does content have to change at all?

dtapuska: mainly, problems with mixed content modes. If you combine language, you get into wierd scenarios where you don't know if you should translate. So we took the approach of being explicit.

tess: Isn't the user turning the feature on the explicit signal?

dtapuska: I'd assume the feature would be on be default. That's the way it works in Chrome today -- translate is on by default but opt-out.

hadley: Since many users aren't native English speakers or don't want content in English, would you imagine this being included on every href since we don't know what users will need translated?

dtapuska: What search team was after was how to make translate attribute work better. I don't think it's applicable to every single href. Not a magical attribute you'd add everywhere.

hadley: It does a bit... it makes it feel it's trying to solve a problem for a particular kind of user that's not about the web itself but about how they expeirence the web -- seems to be the kind of thing that lives in the user agent.

dtapuksa: This attribute lies between the HTML and the user-agent. The user agent looks at this and decides to do something different... like an accessibility thing where you could do something different at the user agent level.

hadley: not more efficient to just have the UA translate things if the user wants them translated?

dtapuska: comes down to that we could get it wrong sometimes

tess: the browser can get it wrong, so sites that link to other sites should change their content to work around it?

tess: so today if your browser backend for autottranslation... what if one browser has a good result and the other browser doesn't. How do you write the page?

dka: One thing I'd like to see is how this works with any translation service, any search provider, and any browser. I'm hearing a lot about how this works with Google ones.

dtapuska: The attribute is feature-detectable. But Tess's problem of whether one browser returns a good result or not is certainly a problem. The referrer page has to pick a translation service...

tess: But with the attribute the referrer page doesn't choose a translation service, that's now a browser decision.

dtapuksa: I don't have a good answer there.

... Previously, if the page wanted translation, they were choosing a translation service that would be good. I'd imagine if it were bad in some browsers they'd have to use hrefTranslate in only some browsers.

tess: Since it's an attribute it's feature detectible... so if writing HTML and most browsers don't have this feature... if I'm writing HTML with translate.google.com?url=... for browsers that don't support hrefTranslate, won't the hrefTranslate then translate it twice if you add it. The fallback story is strange; you have to do content-rewriting.

dtapuksa: The primary target of this is machine-generated pages, not handwritten HTML.

kenneth: how does this work when sharing URLs? This seems like it would be very confusing to people.

dtapuksa: Hopefully in those cases the translation engine pops up and solves it. This is mainly avoiding the roadblock of having users click "I want to translate this page"

Kenneth: so if I'm on a page that's already translated I probably want those translated as well?

dka: Isn't there a social value to the user knowing that the text has been translated and wasn't written as-is by the author? Translation is imperfect. Can get the gist. If I found myself reading broken English and didn't know the pgage hade been translated, is that the experience I want? Shouldn't the user know it's been translate.

dtapuksa: I think there's a user-agent point of view -- UA could indicate that pages are translated in all sorts of fashions. It's up to the UA to provide that signal. That said, people may key off URL being translate.....com?url=...

dka: Feel like this has been a good discussion; sorry if you're hearing negativity.

dtapuksa: This is a proposal, what we wanted to do was experiment with this and determine whether it gets better engagement. That's the state of the world for this feature. Trying to run experiments with this attribute. See if it's a good/bad idea. Obviously want to get TAG feedback on the approach we prototyped.

dka: Would also be good to get feedback from other search providers and other translation service providers.

dtapuska: Do have feedback from Microsoft, not from others. If you have contacts, happy to reach out.

dka: Leave this open in a monitoring sense, and hear back about the experiment?

dtapuska: I'll ping the issue when we have experiment feedback.

Comment by @dbaron Mar 6, 2019 (See Github)

We discussed this in our teleconference today.

Comment by @ylafon Mar 7, 2019 (See Github)

See also https://github.com/whatwg/html/issues/2945 for further discussions and input from i18n.

Discussed Apr 17, 2019 (See Github)

Hadley: We had dtapuska on a call, and he still doesn't seem to agree with our suggestions about how the web platform works. Not sure what we should do next?

David: My memory of that thread is that there was also a fair bit of misunderstanding in the issue.

Hadley: Which misunderstandings are you thinking of? If we can, that would be useful

Torgo: Let's look at it next week. If we still aren't happy, having cleared up those miscommunications... I'll add David to it, since he's going to do that

Comment by @torgo Apr 17, 2019 (See Github)

We discussed on the TAG call today would like to bring closure to this issue. @dbaron and @hadleybeeman will try to write up some of their views of where we stand on this topic.

Comment by @dbaron Apr 17, 2019 (See Github)

So my summary of where we are I think is roughly this:

@dtapuska's use case largely starts from the idea that client-side translation provides a better user-experience than server-side translation because client-side translation can work well on a much wider range of pages (using the range of web APIs) rather than only on largely-static pages
today only some browsers have client-side translation, and the options for fallback that are available today to provide one experience for browsers with client-side translation and one for those without aren't very good. In particular, authors can offer server-side translation everywhere, or they can offer client-side translation in browsers that have it falling back to no translation without client-side translation, but they can't easily offer client-side translation in browsers that have it falling back to server-side translation for browsers without. I see the ability to provide that fallback as the core use case that this spec enables, although I'm not sure if @dtapuska would agree with that characterization.
@hadleybeeman points out that another significant advantage of client-side translation is that the client may have a better idea of the user's desired language than the server, and that this is well within the space of choices that are normally chosen by the client rather than the server. I believe most or all of the TAG agrees with this.

The solution proposed in this issue has two characteristics that the TAG is concerned about (though we've had much more discussion of the first; I'm not sure how widely the second concern is shared):

has the server choose the language for client-side translation
addresses fallback using a javascript-only mechanism, by requiring detection of hrefTranslate support and replacement of the href

I believe there exist alternative solutions that would avoid one or both of these problems while still addressing the use case of allowing sites to depend on client-side translation but fall back to server-side translation in implementations that don't support client-side translation.

For example:

a solution that provides only for javascript feature-detection of client-side translation would fix the first problem though not the second
a solution that provides a second href-like attribute that would be used (when present) instead of href by browsers that support client-side translation would address both of them, although with the disadvantage that in the case when it is used (i.e., the case where the author wants fallback from client-side translation to server-side translation) the more "semantic" href is in an attribute other than href

I'd note that the TAG's preference for what language is used makes the "fallback" here less exact of a fallback (since it may be falling back to displaying a different language when using server-side translation than it would when using client-side translation). However, I think we feel that that is the correct thing to do in the service of providing the best experience available for each case.

Comment by @dtapuska Apr 17, 2019 (See Github)

Providing an alternate href attribute for clients that support client side translation is ok but I fear that it might not be very web compatible. If there are libraries, accessibility plugins or extensions already looking for the href attribute to where the destination will be this software would need to be adjusted to determine the real href the user agent is going to use. Since this is a user setting that software would need to detect whether or not client side translation is enabled.

I think there are lots of scenarios where an alternate href might have issues: What URI do we show for the status bar when hovering? What does Copy Link Address do? Do all CSS attribute selector rules need to be duplicated, etc.

This feature is about a very niche case but with lots of impact for non-english consumers of content. A large percentage of search queries are non-english queries with English locale set. The English locale setting is a status symbol in these markets.

An alternate URI method still misses a property that we need, the ability for the server to influence the next page's language translation. And I believe this is the core of the issue that TAG sees in this request. My position is that the page already has an influence of what the next URI is (whatever language it wants it in) because it is explicitly linking to it. And today search engines might link to server side translation engines. That this proposal is just acknowledging client side translation exists and extending its functionality so that the previous page provides some context to the next page.

Comment by @hadleybeeman May 22, 2019 (See Github)

We have looked at this in our face to face in Iceland and are closing this issue.

This looks like a feature for a single browser with a single use case. We don't see evidence of this being worked on with any kind of multi-stakeholder interest.

This proposed feature is also underpinned by an assumption that doesn't fit with our understanding of the web, since this feature would let the referring site express a preference that we think sits with the user, in the user agent.

We have discussed our concerns on a call with @dtapuska, and the conversation isn't progressing. We are therefore closing this issue. Please open another one if this spec changes substantially or you still have outstanding areas where we can help.

Comment by @dtapuska May 22, 2019 (See Github)

For my future reference so it doesn't look like I did inactivity here.

As per meeting notes I indicated I will ping the issue when we have experimental feedback. The experiment was a little slow starting and I expect we have this in about two months time.

Comment by @dtapuska Sep 19, 2019 (See Github)

So I'd like to reopen this issue because Chrome wishes to proceed with an Intent to Ship on this issue. We'e taken the feedback and incorporated it into our implementation. Highlights are we've surfaced to the user that an automatic translation occurs, added prompts to translate, and we carry forward what the current translation of the page is to outbound links linked from the current page. These are very much as a result of the TAG feedback and the experimentation we've done so we thank everyone for their constructive comments on this proposal. Chromium believes that significant impact comes from changes such as this one to help under-serviced groups, bringing the web forward for all.

Another thing is that Google Search has now begun serving the translated web to users. An example can be found in the explainer. I hope that this further clarifies that Chromium holds the principles of the open, diverse and customizable web.

Discussed Oct 2, 2019 (See Github)

Hadley: I had another look at this. Looks like dtapuska opened it again... didn't know that was possible.

Peter: I re-opened it after he asked.

Hadley: Looks like he has taken feedback on board, but hasn't resolved my fundamental issue that the UA doesn't determine the language of the page...

... User agent is there for the user to express their preference to a website. Breaks that model for the preference to come from the referring website. We already have lots of discussion of this on the issue.

Peter: Ironic that Google.com is notoriously bad at selecting the language for the user when you change locations, on google.com anyway.

Hadley: So what should we do? David?

David: Maybe we should figure out what has changed with the request. I agree with your concern about the outgoing page choosing the language.

Hadley: Let's bump this week.. you're right, we should look at the changes that have been made. We still have the same underlying concern, though, and it doesn't seem to have been addressed.

David: It's not clear what changes have been made; the PR hasn't changed as far as I can tell.

Hadley: Great, we can just ask what has changed.

Comment by @dbaron Oct 2, 2019 (See Github)

We're curious if there are any spec changes as a result of the feedback being incorporated; it doesn't look as though the PR has changed at all. Though there were substantive changes to the explainer in https://github.com/dtapuska/html-translate/commit/8e2293aa713f5735af95e6aed09e0a6352e9f6c5 and a smaller followup commit to fix it up.

Comment by @dbaron Oct 2, 2019 (See Github)

(That said, rereading the explainer, it doesn't feel particularly different than what it was before, other than having a lot of examples.)

Comment by @dtapuska Oct 2, 2019 (See Github)

@dbaron We've highlighted the problems the web has today more concretely. Calling out examples of how it is broken, specifically delivering content that doesn't have attribution to the original origin.

Majortiy of the changes don't land in a specification perhaps but in the implementation of the UX for the User Agent. The UA is in control of what language the page is in. There are now prompts to confirm that the user wishes to translate the target page, and when a translation occurs. Likewise the translation engine keeps the target language for outbound links off of a translated page (this was raised as an issue and we addressed it yet it doesn't really end up much in the UX of a page).

The feedback was to make the browser smarter as opposed to a page dictating what it should be in, and this is the case now. The attribute only serves as a possible hint with more UX apparent to the user.

Comment by @chrishtr Oct 3, 2019 (See Github)

Also, I'd like to add that I don't think the "single browser, single use case" conclusion is really the right one to make. Let me try to summarize why inline (though it's also in the new explainer):

Humans speak many different languages, and it is not feasible for site authors to anticipate all of the ways people who don't speak their language might interact with their site. Also, almost all users don't understand browser settings very well, and are often influenced by default settings or other factors when choosing a UI language. These problems are real, worth solving, and ones that a combination of sites and User Agent smarts can address.

If a referring site has a pretty good idea of in what language the user would want to load another site after navigation -- e.g because the user regularly visits that site -- and the user trusts the referring site site, it is a good thing that the site passes this recommendation on to the User Agent. The User Agent can then do what it thinks is best for the user at that point, which could include (a) ignoring it, (b) providing a UI to the user to give them a choice of what to do, or (c) auto-translating the site via whatever technology is desired, which could include server or client-side features.

The hint is the hrefTranslate attribute.

Comment by @alice Oct 3, 2019 (See Github)

@chrishtr

... it's also in the new explainer

Would you mind clarifying which specific part of the new explainer you're referring to, just for easy access?

Also, almost all users don't understand browser settings very well, and are often influenced by default settings or other factors when choosing a UI language.

Could you elaborate on why a UA couldn't solve this problem by making those settings easier to access (for example, prompting a user who frequently searches via the address bar in language A but has their UA language set as language B to set their UA language to A)? Do the other factors alluded to rule out a purely UA-based solution?

Comment by @chrishtr Oct 3, 2019 (See Github)

Would you mind clarifying which specific part of the new explainer you're referring to, just for easy access?

Sure thing. The list of Pros of the API includes the points I'm making here, plus more.

The Problem Statement section also goes into more detail about the problems caused by sites trying to help out the user with translation without any UA coordination. In particular, it's not just rendering quality.

Could you elaborate on why a UA couldn't solve this problem by making those settings easier to access (for example, prompting a user who frequently searches via the address bar in language A but has their UA language set as language B to set their UA language to A)? Do the other factors alluded to rule out a purely UA-based solution?

To your first question:

If a user frequently searches via the address bar, then the UA could for sure learn something about their preferences. But almost all interaction with sites, other than the very special case of search, happens in the page itself and the UA (by design) has a limited idea of what is going on.

To your second:

A hypothetical UA that reads all of the content of your websites and observes what you do could build machine learning models that predict your language preferences. To some extent browsers today attempt to do that, by observing the detected language of visited pages and keeping statistics.

But to rely on the UA and only the UA for this is much too limiting, and actually works against open-ness, decentralization and choice. There is room for smart sites to offer suggestions to the browser to help the user experience, with the user's permission.

Comment by @alice Oct 3, 2019 (See Github)

@chrishtr Thank you for the response!

But almost all interaction with sites, other than the very special case of search, happens in the page itself and the UA (by design) has a limited idea of what is going on.

It would be nice to have an example in the explainer of how this would be used by a non-search website, in that case - the Motivation section seems to be based exclusively around search.

(Also, given the explainer is now longer than a single page, a table of contents would be helpful!)

Comment by @chrishtr Oct 3, 2019 (See Github)

It would be nice to have an example in the explainer of how this would be used by a non-search website, in that case - the Motivation section seems to be based exclusively around search.

Sure. An example: Facebook routinely surfaces web page links users might be interested in exploring. It would set hrefTranslate if its relationship with the user and understanding of the web page indicated it would be a good idea to translate it.

Discussed Oct 16, 2019 (See Github)

Hadley: We only have 1 minute left, I would like to spend more time than that on this issue.

Discussed Oct 29, 2019 (See Github)

Hadley: discussed recently?

(looks like we discussed it 4 weeks ago)

Hadley: (reads chrishtr's last comment) Still seems like it's changing relationships: putting the referring site in the position of being the user agent. I feel like we keep going in circles on this.

Hadley: Also 4 comments up... misses that what we meant by "one browser" is that the only browser team involved is Google.

Hadley: Fundamentally I think this is from the ethical principles -- just not the way we think things work.

Alice: curious what you thought of chrishtr's answers to the second quoted question in this comment

Hadley: As someone who cares about Privacy, it frightens me that we would transfer responsibility for knowing my preferences to the website. He says relying on the UA only is too limiting to know what preferences are, works against openness, decentralization, and choice. It doesn't in the way I think about those terms. I feel like his last sentence about "smart sites" is useful but very different from offering suggestions to the next site.

Hadley: I'm stuck -- we already closed this "unsatisfied"; that's what I keep coming back to. I think we're fundamentally on different pages here about how the web works.

Peter: I'm also consider the explanier suggests it would influence the Accept-Language HTTP header... which it really shouldn't be doing.

Hadley: Have we written that down?

Peter: Not sure.

Hadley: We had in our feedback a year ago, from Paris... we talked about Accept-Language from the UA as an alternative. So I don't think it was in the explainer then.

Peter: I agree it's the wrong design; I don't see anything convincing me otherwise.

Alice: how convinced are you about the use cases they've explained?

Hadley: not convinced because the way I'm reading the use cases, they're use cases about developer of one website wanting more control about what happens on another website.

Peter: Only thoughts are either (a) close again or (b) invite Dave to a call to try to explain to us what we're missing?

Hadley: I'm up for that.

Alice: I could...

Hadley: Did we have a call with him before?

David: I think so.

Peter: I'm happy to if we think it would make progress. Make the offer... leave it up to him?

Hadley: I don't remember chrishtr being in the discussion then -- maybe a conversation with both would be useful?

Alice: I'm happy to ask if they could join us.

Comment by @hadleybeeman Oct 29, 2019 (See Github)

We've discussed this on our telecon today. We'd like to invite you for a call, @dtapuska (and anyone else relevant), to talk through what's changed and what want to accomplish here. We'll be in touch offline.

Comment by @dtapuska Oct 31, 2019 (See Github)

@hadleybeeman Looking back on the feedback you've provided I'm wondering if you think our proposal is to share a user's preferences that are stored on some logged in account with the next site? ie. Is your assumption that the website by setting the attribute is leaking something about user preferences? This is not the scenario we are describing at all. The scenario we are talking about is an empheral user interfacing with a website in some language that can't be determined by the browser and the website is just surfacing the language that the user interacted in.

We've tried to articulate that it is impossible for the browser to solely determine the interaction language (as we've given examples of interaction with speach, and we could provide more). It is unclear to me how you'd iterate on your suggestions to make the browser "smarter" to know what accept-language to send on the subsequent request (or to translate the page into).

Comment by @torgo Nov 5, 2019 (See Github)

Scheduled for call next week and hopefully you can dial in for that @dtapuska. Alice is going to contact you...

Comment by @dbaron Nov 13, 2019 (See Github)

This was discussed in the call today, for which minutes should soon be available.

I wanted to follow up on one point I raised in the discussion today. @chrishtr made the good point that one of the motivating factors here is that (a) many users don't know how to configure browsers or even understand the difference between the browser and the website and (b) the sites they're using (say, Google or Facebook) might be able to figure out through the text that the user types or interacts with what language the user wants to speak. I found this to be the most convincing argument for work in the problem space of having the site suggest the user's desired language.

However, this particular proposal results in each site being allowed to fix things for outbound links from their site, but the user's browser still not knowing the user's preferred language, and thus not being able to suggest the correct translation for all sites. This can push the user towards a more walled-garden view of the web -- if Google is the only site that knows their preferred language, then they can only browse the web starting from Google. If Facebook is the only site that knows the user's preferred language, then they can only browse the web starting from Facebook. But if the underlying mechanism was instead a way for the site to suggest to the browser that it knows something about the user's preferred language, then the browser might be able to ask the user if they prefer another language (perhaps even by asking in both its currently configured language and the newly proposed language).

During the call I suggested the first idea that came to mind, that the site could just directly suggest to the browser what they think the user's preferred language is, and the browser could choose to act or not act on that information. Now I realize that the browser could, if it wanted, do something with the hrefTranslate information and make the same suggestion to the user based on that information. (Would it make sense for the explainer to suggest this possibility?)

There was one other issue @hadleybeeman asked me to raise at the beginning of our discussion and I said I'd rather move on to the other points -- but I never actually managed to come back to it. I think we were also somewhat concerned that many of the fallback options here require javascript. I think it's also worth thinking about the different sorts of fallback that might be desired. I feel like we discussed this issue before but I don't see it above, so I'll state it here (possibly again).

In particular, if the linking site's first preference is to link to https://example.com/ with hrefTranslate=hi... it seems different sites might have different preferences for what should happen as fallback if the browser doesn't support client-side translation. Option (1) might be that the linking site would prefer the user just go to https://example.com/ without any translation or other intervention. Option (2) would be that the linking site would prefer to link to https://translate.google.com/translate?sl=en&tl=hi&u=https%3A%2F%2Fexample.com%2F in order to use server-side translation. Currently the mechanism proposed here defaults to (1), even though I suspect more sites probably want (2), whereas doing fallback type (2) requires Javascript. Perhaps that's OK; it at least keeps the markup "cleaner" from a certain perspective.

At the very least, it seems like the explainer should mention the question of fallback for browsers that don't support the feature, point out that the attribute should only be implemented in implementations that support client-side translation so that feature detection works, and perhaps give an example of feature detection.

I'd also note the explainer currently says:

Automatically fallback to server-side translation if client side isn’t supported

... which isn't really true, since that's not the fallback that happens automatically.

Discussed Nov 26, 2019 (See Github)

Hadley: complications - we discussed tradeoffs and knock-on effects. We agreed they would expand on these issues in the explainer which has not really been done. This is largely where it was. they did add a ToC which we asked for. doesn't encapuslate the richness of our discussion. We had concerns - not reflected. David commented - expanding on his concern that the asusmption is that one site knows more about the language preference than the browser, this promotes a walled garden view of the web. I understood they had made achange to make it more browser-centric.

David: it was always a hint to the browser to translate that site into the language specificed.

Hadley: so the browser must do something sensible with that?

David: if the search engine sends to the browser that the user wants this translated into X then maybe the browser should ask "is this your prefered language"?

Hadley: what would you like them to do with it? ... permisison prompt when href tranfstalte is encountered from non-trusted sites. how much belongs in the spec vs implementation specific.

David: i think it's worth mentioning that idea informatively.

Hadley: I don't see most of what we wanted in the explainer. we talked about the permission model and the danegrs of giving blanked protections. They have added one line to the answer to the privacy & security questionnaire. I feel the explainer doesn't capture the thought process that went into shaping htis proposal - so UAs can't make sensible decisions about what the dangers are. So i feel we have put a lot into this - open for a year - 2 calls on it - and a long issue thread - don't feel we can go further.

Dan: We should close ?

David: we should say something like "we were expecting a bit more change to the explainer" - one last chance. I could craft that.

Comment by @dtapuska Nov 26, 2019 (See Github)

I've updated the explainer. Changes are:

table of contents added
news aggregation use case (facebook) added
feature detection called out and example given
privacy section called out that UAs should prompt and reference provided to Chrome's privacy whitepaper.

If you wish further changes please let me know. I hope to send an intent to ship for chrome shortly.

Comment by @dbaron Nov 26, 2019 (See Github)

We discussed this in our teleconference today. Based on the discussion we had two weeks ago, we were expecting to see somewhat more change to the explainer. I think a number of TAG members were inclined to close this again as "unsatisfied", but I wanted to say explicitly that we were expecting a bit more change to the explainer so that you had another chance to revise it in response to that feedback.

Currently, the explainer seem to be internally inconsistent on a number of points:

whether hrefTranslate is a hint to the user-agent or a command that the site can expect to be reliably followed
whether the translation done by a browser implementing hrefTranslate must be client-side, or might instead be server side. (I think this is mainly that the sentence "Note that this translation service could be server-side or client-side, depending on which is available." disagrees with the opposite point made in a number of other places in the explainer.)

We were also hoping to see a little bit more explanation in the "Privacy Considerations" section of the explainer, which is currently three sentences. It seems worth explaining what some of the issues that people might be worried about are and where this proposal does or doesn't have issues that are worth thinking about. For example, translation generally has the issue that the user has to trust the translation service with their browsing history. But beyond that, it seems worth comparing the privacy characteristics of browser-mediated ("client-side") translation (where there's less direct transfer of information from one website to another, but where the website the user is visiting can probably figure out by looking at the DOM or layout results that this user in particular has translated the page to a particular language) with the privacy characteristics of server-mediated translation (where the origin model is broken and the referring site directly invokes the translation service by its URL, but the translated page probably then knows less about the user). I think there was a good bit more discussed when we talked about this two weeks ago, and I don't remember all the details, but I do recall that our conclusion then was that the explainer could say a bit more about the privacy and security considerations to make it clear how the relevant issues had (or hadn't) been considered.

Along similar lines, it also feels worth expanding on the statement that there will be a permission prompt, with a clearer explanation of what it is that the browser needs to ask permission for, and why user consent is needed.

I think it also may be worth expanding on the ecosystem concerns that I raised in the first part of https://github.com/w3ctag/design-reviews/issues/301#issuecomment-553180605 .

Comment by @dtapuska Nov 27, 2019 (See Github)

It definitely is only a hint. I thought that was clear in the document where it is presented on a number of occasions. I've adjusted the text where it said "reliably invoke the client side translation" to "reliably invoke the request for client-side". That is the only spot that I read there might be some misunderstanding. Please let me know if there are others from your point of view.
The explainer made reference to server-based and client-based translation and then a subsection of client-based which is server or client based. I've changed the wording to client-based online and offline and provided a section around defintions to help clarify this.
Sorry I was travelling for two weeks and some things got lost. I know we wanted to expand this and I went back to the notes and raw meeting notes from the call, the only reference I found was that me confirming that I'd do that. So when I went to look at things I saw that mostly the things I recall we talked about were relevant to a implementor of a client-based translation service. I've hopefully detailed it a little better for you. Feedback is welcome.

Comment by @dbaron Dec 4, 2019 (See Github)

@torgo @hadleybeeman and I are looking at this at a breakout at our Cupertino meeting.

OK, thanks for those updates; the explainer does now appear to be clearer.

It seems like there are still some things we fundamentally disagree about here, but there are also a number of ways this proposal and its explainer have become clearer during this review process, which we appreciate.

It doesn't appear that there's a lot more we can do right now, and it sounds like you're interested in going ahead and experimenting with this in origin trials or something similar, so we think it's best to close this issue for now. However, if you have feedback from origin trials that we would be interested in, feel free to ask us to reopen this issue, or file a new issue.

Comment by @dtapuska Dec 5, 2019 (See Github)

For the record we've already conducted origin trials for this. We are interested proceeding to ship it.