asciifying http header for csv download; fixes #3952 #3975

rumbin · 2017-12-01T20:11:58Z

I think this should be all that is needed to fix #3952.

This way, the rather pathologic tab label →ʬéıρδ Ñämë← will get converted to WWeird_Name, which is pretty much what I would expect it to be in non-unicode representation.

xrmx

Also a test would be nice :)

xrmx · 2017-12-02T09:39:27Z

superset/views/core.py

@@ -2288,7 +2289,7 @@ def csv(self, client_id):
            csv = df.to_csv(index=False, **config.get('CSV_EXPORT'))
        response = Response(csv, mimetype='text/csv')
        response.headers['Content-Disposition'] = (
-            'attachment; filename={}.csv'.format(query.name))
+            'attachment; filename={}.csv'.format(unidecode(query.name)))


Maybe this is enough?

query.name.encode('ascii', 'ignore')

For a German there is a big difference, whether the Umlauts are entirely dropped or just replaced by their closest ascii counterpart.

I do understand that it may seem inappropriate to introduce yet another dependency for such a minor change, but I feel like this is affecting the user experience, especially of the less technically versed users.

You may safely skip the following pleadings…

There are languages, where this is even more relevant. To name an extreme example, in Turkish there exists this nice word düsündürttürücülügümüzün. Personally I think that dusundurtturuculugumuzun may still be readable, but dsndrttrclgmzn is not.

Also think of the Finnish people: A SQLLab tab on the unemployment in the town of Järvenpää työttömyys_Järvenpää could either be tyttmyys_Jrvenp or tyottomyys_Jarvenpaa.

And finally, also our Frensh speaking friends may benefit from not dropping the letters: élevé becomes lev or eleve.

rumbin · 2017-12-02T19:17:32Z

@xrmx: Judging from your smiley, I assume that you expect me to still not having the slightest idea of how to write a test for the csv function that this PR is all about. And you are right.

If somebody (you?) could give me a hand, especially for getting #3705 through, I would be really glad...

xrmx · 2017-12-02T22:47:45Z

The smile was me begging for tests. Anyway look at tests.core_tests.CoreTests.test_csv_endpoint copy tht test to something like test_csv_endpoint_works_returns_ascii_headers and test that you can decode('ascii') the header.

mistercrunch · 2017-12-05T20:04:47Z

Unidecode will be useful in other contexts, I'm ok with the new dep.

* asciifying http header for csv download; fixes apache#3952 * fixed order of imports and added unidecode to requirements in setup.py

rumbin added 2 commits December 1, 2017 21:00

asciifying http header for csv download; fixes apache#3952

22b6f6a

fixed order of imports and added unidecode to requirements in setup.py

b4ff8e8

xrmx reviewed Dec 2, 2017

View reviewed changes

mistercrunch merged commit e98a1c3 into apache:master Dec 5, 2017

mistercrunch added 🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels 🚢 0.21.0 labels Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asciifying http header for csv download; fixes #3952 #3975

asciifying http header for csv download; fixes #3952 #3975

rumbin commented Dec 1, 2017

xrmx left a comment

xrmx Dec 2, 2017

rumbin Dec 2, 2017

rumbin commented Dec 2, 2017

xrmx commented Dec 2, 2017

mistercrunch commented Dec 5, 2017

asciifying http header for csv download; fixes #3952 #3975

asciifying http header for csv download; fixes #3952 #3975

Conversation

rumbin commented Dec 1, 2017

xrmx left a comment

Choose a reason for hiding this comment

xrmx Dec 2, 2017

Choose a reason for hiding this comment

rumbin Dec 2, 2017

Choose a reason for hiding this comment

rumbin commented Dec 2, 2017

xrmx commented Dec 2, 2017

mistercrunch commented Dec 5, 2017