Show More
@@ -1,127 +1,125 b'' | |||
|
1 | 1 | .. _performance: |
|
2 | 2 | |
|
3 | 3 | ================================ |
|
4 | 4 | Optimizing Kallithea performance |
|
5 | 5 | ================================ |
|
6 | 6 | |
|
7 | 7 | When serving a large amount of big repositories, Kallithea can start performing |
|
8 | 8 | slower than expected. Because of the demanding nature of handling large amounts |
|
9 | 9 | of data from version control systems, here are some tips on how to get the best |
|
10 | 10 | performance. |
|
11 | 11 | |
|
12 | 12 | |
|
13 | 13 | Fast storage |
|
14 | 14 | ------------ |
|
15 | 15 | |
|
16 | 16 | Kallithea is often I/O bound, and hence a fast disk (SSD/SAN) and plenty of RAM |
|
17 | 17 | is usually more important than a fast CPU. |
|
18 | 18 | |
|
19 | 19 | |
|
20 | 20 | Caching |
|
21 | 21 | ------- |
|
22 | 22 | |
|
23 | 23 | Tweak beaker cache settings in the ini file. The actual effect of that is |
|
24 | 24 | questionable. |
|
25 | 25 | |
|
26 | 26 | .. note:: |
|
27 | 27 | |
|
28 | 28 | Beaker has no upper bound on cache size and will never drop any caches. For |
|
29 | 29 | memory cache, the only option is to regularly restart the worker process. |
|
30 | 30 | For file cache, it must be cleaned manually, as described in the `Beaker |
|
31 | 31 | documentation <https://beaker.readthedocs.io/en/latest/sessions.html#removing-expired-old-sessions>`_:: |
|
32 | 32 | |
|
33 | 33 | find data/cache -type f -mtime +30 -print -exec rm {} \; |
|
34 | 34 | |
|
35 | 35 | |
|
36 | 36 | Database |
|
37 | 37 | -------- |
|
38 | 38 | |
|
39 | 39 | SQLite is a good option when having a small load on the system. But due to |
|
40 | 40 | locking issues with SQLite, it is not recommended to use it for larger |
|
41 | 41 | deployments. |
|
42 | 42 | |
|
43 | 43 | Switching to PostgreSQL or MariaDB/MySQL will result in an immediate performance |
|
44 | 44 | increase. A tool like SQLAlchemyGrate_ can be used for migrating to another |
|
45 | 45 | database platform. |
|
46 | 46 | |
|
47 | 47 | |
|
48 | 48 | Horizontal scaling |
|
49 | 49 | ------------------ |
|
50 | 50 | |
|
51 |
Scaling horizontally means running several Kallithea instances |
|
|
52 | share the load. That can give huge performance benefits when dealing with large | |
|
53 | amounts of traffic (many users, CI servers, etc.). Kallithea can be scaled | |
|
54 | horizontally on one (recommended) or multiple machines. | |
|
51 | Scaling horizontally means running several Kallithea instances (also known as | |
|
52 | worker processes) and let them share the load. That is essential to serve other | |
|
53 | users while processing a long-running request from a user. Usually, the | |
|
54 | bottleneck on a Kallithea server is not CPU but I/O speed - especially network | |
|
55 | speed. It is thus a good idea to run multiple worker processes on one server. | |
|
55 | 56 | |
|
56 | It is generally possible to run WSGI applications multithreaded, so that | |
|
57 | several HTTP requests are served from the same Python process at once. That can | |
|
58 | in principle give better utilization of internal caches and less process | |
|
59 | overhead. | |
|
57 | .. note:: | |
|
60 | 58 | |
|
61 | One danger of running multithreaded is that program execution becomes much more | |
|
62 | complex; programs must be written to consider all combinations of events and | |
|
63 | problems might depend on timing and be impossible to reproduce. | |
|
59 | Kallithea and the embedded Mercurial backend are not thread-safe. Each | |
|
60 | worker process must thus be single-threaded. | |
|
64 | 61 | |
|
65 | Kallithea can't promise to be thread-safe, just like the embedded Mercurial | |
|
66 | backend doesn't make any strong promises when used as Kallithea uses it. | |
|
67 | Instead, we recommend scaling by using multiple server processes. | |
|
62 | Web servers can usually launch multiple worker processes - for example ``mod_wsgi`` with the | |
|
63 | ``WSGIDaemonProcess`` ``processes`` parameter or ``uWSGI`` or ``gunicorn`` with | |
|
64 | their ``workers`` setting. | |
|
68 | 65 | |
|
69 | Web servers with multiple worker processes (such as ``mod_wsgi`` with the | |
|
70 | ``WSGIDaemonProcess`` ``processes`` parameter) will work out of the box. | |
|
71 | ||
|
66 | Kallithea can also be scaled horizontally across multiple machines. | |
|
72 | 67 | In order to scale horizontally on multiple machines, you need to do the |
|
73 | 68 | following: |
|
74 | 69 | |
|
75 | 70 | - Each instance's ``data`` storage needs to be configured to be stored on a |
|
76 | 71 | shared disk storage, preferably together with repositories. This ``data`` |
|
77 | 72 | dir contains template caches, sessions, whoosh index and is used for |
|
78 | 73 | task locking (so it is safe across multiple instances). Set the |
|
79 | 74 | ``cache_dir``, ``index_dir``, ``beaker.cache.data_dir``, ``beaker.cache.lock_dir`` |
|
80 | 75 | variables in each .ini file to a shared location across Kallithea instances |
|
81 | 76 | - If using several Celery instances, |
|
82 | 77 | the message broker should be common to all of them (e.g., one |
|
83 | 78 | shared RabbitMQ server) |
|
84 | 79 | - Load balance using round robin or IP hash, recommended is writing LB rules |
|
85 | 80 | that will separate regular user traffic from automated processes like CI |
|
86 | 81 | servers or build bots. |
|
87 | 82 | |
|
88 | 83 | |
|
89 | 84 | Serve static files directly from the web server |
|
90 | 85 | ----------------------------------------------- |
|
91 | 86 | |
|
92 | 87 | With the default ``static_files`` ini setting, the Kallithea WSGI application |
|
93 | 88 | will take care of serving the static files from ``kallithea/public/`` at the |
|
94 | 89 | root of the application URL. |
|
95 | 90 | |
|
96 | 91 | The actual serving of the static files is very fast and unlikely to be a |
|
97 | 92 | problem in a Kallithea setup - the responses generated by Kallithea from |
|
98 | 93 | database and repository content will take significantly more time and |
|
99 | 94 | resources. |
|
100 | 95 | |
|
101 | 96 | To serve static files from the web server, use something like this Apache config |
|
102 | 97 | snippet:: |
|
103 | 98 | |
|
104 | 99 | Alias /images/ /srv/kallithea/kallithea/kallithea/public/images/ |
|
105 | 100 | Alias /css/ /srv/kallithea/kallithea/kallithea/public/css/ |
|
106 | 101 | Alias /js/ /srv/kallithea/kallithea/kallithea/public/js/ |
|
107 | 102 | Alias /codemirror/ /srv/kallithea/kallithea/kallithea/public/codemirror/ |
|
108 | 103 | Alias /fontello/ /srv/kallithea/kallithea/kallithea/public/fontello/ |
|
109 | 104 | |
|
110 | 105 | Then disable serving of static files in the ``.ini`` ``app:main`` section:: |
|
111 | 106 | |
|
112 | 107 | static_files = false |
|
113 | 108 | |
|
114 | 109 | If using Kallithea installed as a package, you should be able to find the files |
|
115 | 110 | under ``site-packages/kallithea``, either in your Python installation or in your |
|
116 | 111 | virtualenv. When upgrading, make sure to update the web server configuration |
|
117 | 112 | too if necessary. |
|
118 | 113 | |
|
119 | 114 | It might also be possible to improve performance by configuring the web server |
|
120 | 115 | to compress responses (served from static files or generated by Kallithea) when |
|
121 | 116 | serving them. That might also imply buffering of responses - that is more |
|
122 | 117 | likely to be a problem; large responses (clones or pulls) will have to be fully |
|
123 | 118 | processed and spooled to disk or memory before the client will see any |
|
124 | 119 | response. See the documentation for your web server. |
|
125 | 120 | |
|
126 | 121 | |
|
127 | 122 | .. _SQLAlchemyGrate: https://github.com/shazow/sqlalchemygrate |
|
123 | .. _mod_wsgi: https://modwsgi.readthedocs.io/ | |
|
124 | .. _uWSGI: https://uwsgi-docs.readthedocs.io/ | |
|
125 | .. _gunicorn: http://pypi.python.org/pypi/gunicorn |
General Comments 0
You need to be logged in to leave comments.
Login now