Python CGI: Difference between revisions

From XPUB & Lens-Based wiki
Line 58: Line 58:


=== Unicode issue with Apache ===
=== Unicode issue with Apache ===
An unfortunate side-effect of python3 getting "smarter" about text encoding is that it depends on the enclosing environment to "do the right thing", that is, pick the right text encoding to use when you print stuff. In Apache unfortunately the default cgi environment has no encoding set, so python defaults back to ASCII (and often makes your cgi silently fail in the browser).
An unfortunate side-effect of python3 getting "smarter" about text encoding is that it depends on the enclosing environment to "do the right thing", that is, pick the right text encoding to use when you print stuff. In Apache unfortunately the default cgi environment has no encoding set, so python defaults back to ASCII (and often makes your cgi silently fail in the browser). The solution is to make sure Apache passes on the "LANG" environment variable from the system to the script. This is achieved by two small steps:


* Add PassEnv LANG to your site settings (sites-available/000-default.conf inside the <VirtualHost> or to the end of your /etc/apache2/apache2.conf or .htaccess).
* Add PassEnv LANG to your site settings (sites-available/000-default.conf inside the <VirtualHost> or to the end of your /etc/apache2/apache2.conf or .htaccess).
* Uncomment . /etc/default/locale line in /etc/apache2/envvars.
* Uncomment . /etc/default/locale line in /etc/apache2/envvars.
* Make sure line similar to LANG="en_US.UTF-8" is present in /etc/default/locale.
 
Finally: make sure line similar to LANG="en_US.UTF-8" is present in /etc/default/locale (if it's not, you may need to reconfigure your shell environment to use utf-8).


https://stackoverflow.com/questions/9322410/set-encoding-in-python-3-cgi-scripts#19574801
https://stackoverflow.com/questions/9322410/set-encoding-in-python-3-cgi-scripts#19574801

Revision as of 06:50, 17 April 2018

Sample code

Hello World

#!/usr/bin/python
print ("Content-type: text/html;charset=utf-8\n")
print ("Hello world!")

Dump the env!

#!/usr/bin/env python
#-*- coding:utf-8 -*-

import cgi
cgi.print_environ()

Receive some text from a form

#!/usr/bin/env python
#-*- coding:utf-8 -*-

import cgitb; cgitb.enable()
import cgi

q = cgi.FieldStorage()
text = q.getvalue("text", "")
print ("Content-type: text/html;charset=utf-8\n")

# Print any received text in a purple box
print ("""<h1>Hello</h1>""")
if text:
    print ("""<div style="border: 5px solid purple">""")
    print text
    print ("""</div>""")

# Print the form
print ("""<form method="get" action="">
<textarea name="text"></textarea>
<input type="submit" />
</form>""")

RTFM

Python has a CGI module that takes care of many things like working with values posted form a form or handling an uploaded file.

http://docs.python.org/2/library/cgi.html#module-cgi

Python CGI Checklist

Python 3

Mini server

python -m http.server --cgi 8000

Unicode issue with Apache

An unfortunate side-effect of python3 getting "smarter" about text encoding is that it depends on the enclosing environment to "do the right thing", that is, pick the right text encoding to use when you print stuff. In Apache unfortunately the default cgi environment has no encoding set, so python defaults back to ASCII (and often makes your cgi silently fail in the browser). The solution is to make sure Apache passes on the "LANG" environment variable from the system to the script. This is achieved by two small steps:

  • Add PassEnv LANG to your site settings (sites-available/000-default.conf inside the <VirtualHost> or to the end of your /etc/apache2/apache2.conf or .htaccess).
  • Uncomment . /etc/default/locale line in /etc/apache2/envvars.

Finally: make sure line similar to LANG="en_US.UTF-8" is present in /etc/default/locale (if it's not, you may need to reconfigure your shell environment to use utf-8).

https://stackoverflow.com/questions/9322410/set-encoding-in-python-3-cgi-scripts#19574801

A simple test CGI...

#!/usr/bin/env python3
import sys
import cgitb; cgitb.enable()

print('Content-Type: text/html; charset=utf-8')
print()
print('<html><body><pre>' + sys.stdout.encoding + '</pre>h€lló wörld<body></html>')