Ollama LLM Provider

Issue created by @marcus_johansson
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
If we decide that this should be in core, we need to communicate this with @orkut-murat-yılmaz and also maybe get him onboard.

We also need to decide if the AI Interpolator rules and Search API AI Plugins are part of core as well. I would say it makes sense to have something that does something out of the box.
Comment about 1 year ago →
🇱🇹Lithuania mindaugasd
Commented on this here 📌 [meta] Discussion: what LLM providers to include Active and here ✨ Create AI ecosystem "add-ons" page Active .
Comment about 1 year ago →
🇬🇧United Kingdom yautja_cetanu
Features Olama might need:

Some ability to do content moderation or use another LLM's https://www.drupal.org/project/ai/issues/3454452 ✨ [META] Create an AI Security module for custom moderation calls Active

Ability to find models that work with Olama in either some browser or by copying and pasting IDs from an external site

Ability to just download and install a model directly on the server

Ability to connect to another model hosted on another server using ollama

Perhaps the ability for the Drupal site to have remote control over the other ollama server so can download and setup models directly on that
Comment about 1 year ago →
🇱🇹Lithuania mindaugasd
Few more tasks:

Prepare information what kind of hardware is needed to run ollama, and how much does it cost

Document how to, or code a feature to shutdown GPU instance when ollama is not in use

Completing these tasks, we could figure how many people can afford this in practice, and how cost effective it can be to run.

One real use-case of this module: installing it locally for people who have a decent GPU on their local computer.

Because of these constraints, it should probably be outside of AI module (not included), unless we find out that it can be practical for most people.
Comment about 1 year ago →
🇬🇧United Kingdom yautja_cetanu
That is worth doing but a couple of things:

ollama can run a lot of opensource models. Some are very tiny and can be run on any server without a GPU

This provider here will be a better use case for what you bought up. For people installing it locally with a decent GPU, LMStudio's UI is just so much easier than Olama. https://www.drupal.org/project/ai/issues/3453592 📌 LM Studio LLM Provider Active

I think Olama will eventually be for organisations that really want this for privacy reasons. Whilst documenting pricing in documentation, especially in your AI Initiative page will be very helpful. Knowing what people have now is probably not helpful as many clients are talking about wanting self-hosted AI anyway because the privacy is worth whatever cost.

I think an Ollama implementation is much more likely to be included in Starshot. Gabor was inspired by: https://hacks.mozilla.org/2024/05/experimenting-with-local-alt-text-gene...
Comment about 1 year ago →
🇱🇹Lithuania mindaugasd
Some are very tiny

What can one do with a tiny model. Maybe some specialized automation.

worth whatever cost

Does it need to be included in Drupal CMS for everybody then. For clients who have enough resources for this, agencies/developers can set it up for them.

Gabor was inspired

In general, people show demand to experiment with local AI. So if there is demand for whatever reason, it could be included.

included in Starshot

Another question is how to make it easy enough and accessible to regular Drupal CMS users. How does one install it on the server actually. How much knowledge, investment and experience does it require.
Comment about 1 year ago →
🇱🇹Lithuania mindaugasd
Related issue 📌 [META] Test project browser on popular hosting providers Active
Comment about 1 year ago →
System Message

Marcus_Johansson → committed 42cc0cde on 1.0.x
Issue #3453594 by Marcus_Johansson: Ollama LLM Provider
Status changed to Needs review about 1 year ago10:13am 21 June 2024
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
So an initial version of this provider is done and can be test on DEV.

The first version has no controlling of Ollama, like pulling/deleting models. This has to be done via command line at the moment. But as soon as that is done, its usable.

Someone should test it with the explorers.

It supports chat and embed for now. Text completion when I get the time to add that.
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
Regarding document how to, or code a feature to shutdown GPU instance when ollama is not in use.

This should be done in a host solution like a Runpod module or something similar. I have been thinking about building such a solution. Anyway, it should not be in the AI module, the external modules can talk to the AI module for events.
Status changed to Needs work about 1 year ago8:52pm 21 June 2024

Comment about 1 year ago →

🇩🇰Denmark ressa Copenhagen

Thanks @Marcus_Johansson, I tried the module, and almost got it working ...

System

Debian 12
DDEV 1.23.2

Modules

I enabled these modules:

AI Core
Key
AI API Explorer
Ollama Provider

drush in ai key ai_api_explorer provider_ollama

I see that key module is a requirement in the ai module, but Ollama has no need for a key ... maybe the requirement should be set under the individual providers instead?

$ grep -iR -A 2 "dependencies:" .
./ai.info.yml:dependencies:
./ai.info.yml-  - key:key

Setup Ollama Authentication

I entered:

Host Name: http://127.0.0.1
Port: 11434

Ollama

Ollama is available:

$ curl 127.0.0.1:11434
Ollama is running

Two Ollama models are installed:

$ ollama list
NAME                 	ID          	SIZE  	MODIFIED    
dolphin-llama3:latest	613f068e29f8	4.7 GB	2 weeks ago	
llama3:latest        	365c0bd3c000	4.7 GB	3 weeks ago

Two Ollama models are available via command line:

$ curl 127.0.0.1:11434/api/tags
{"models":[{"name":"dolphin-llama3:latest","model":"dolphin-llama3:latest","modified_at":"2024-06-06T22:40:03.181858319+02:00","size":4661235994,"digest":"613f068e29f863bb900e568f920401b42678efca873d7a7c87b0d6ef4945fadd","details":{"parent_model":"","format":"gguf","family":"llama","families":["llama"],"parameter_size":"8B","quantization_level":"Q4_0"}},{"name":"llama3:latest","model":"llama3:latest","modified_at":"2024-05-27T12:53:40.033272983+02:00","size":4661224676,"digest":"365c0bd3c000a25d28ddbf732fe1c6add414de7275464c4e4d1c3b5fcb5d8ad1","details":{"parent_model":"","format":"gguf","family":"llama","families":["llama"],"parameter_size":"8.0B","quantization_level":"Q4_0"}}]}

AI Chat Explorer

When I select Ollama from the dropdown, "Provider Configuration" pops up but is empty, and I get an error "Error message -- Oops, something went wrong. Check your browser's developer console for more details." where I should have had the two models presented I guess?

From the console:

XHRPOST
https://drupal10.ddev.site/admin/config/ai/development/chat-generation?ajax_form=1&_wrapper_format=drupal_ajax
[HTTP/2 500 Internal Server Error 55ms]
  
POST
  https://drupal10.ddev.site/admin/config/ai/development/chat-generation?ajax_form=1&_wrapper_format=drupal_ajax
Status
500
Internal Server Error
VersionHTTP/2
Transferred4.69 kB (4.32 kB size)
Referrer Policystrict-origin-when-cross-origin
Request PriorityHighest

Uncaught 
Object { message: "\nAn AJAX HTTP error occurred.\nHTTP Result Code: 500\nDebugging information follows.\nPath: /admin/config/ai/development/chat-generation?ajax_form=1\nStatusText: Internal Server Error\nResponseText: The website encountered an unexpected error. Try again later.GuzzleHttp\\Exception\\ConnectException: cURL error 7: Failed to connect to 127.0.0.1 port 11343 after 0 ms: Couldn&#039;t connect to server (see https://curl.haxx.se/libcurl/c/libcurl-errors.html) for http://127.0.0.1:11343/api/tags in GuzzleHttp\\Handler\\CurlFactory::createRejection() (line 210 of /var/www/html/vendor/guzzlehttp/guzzle/src/Handler/CurlFactory.php). GuzzleHttp\\Handler\\CurlFactory::finishError(Object, Object, Object) (Line: 110)\nGuzzleHttp\\Handler\\CurlFactory::finish(Object, Object, Object) (Line: 47)\nGuzzleHttp\\Handler\\CurlHandler-&gt;__invoke(Object, Array) (Line: 28)\nGuzzleHttp\\Handler\\Proxy::GuzzleHttp\\Handler\\{closure}(Object, Array) (Line: 48)\nGuzzleHttp\\Handler\\Proxy::GuzzleHttp\\Handler\\{closure}(Object, Array) (Line: 35)\nGuzzleHttp\\PrepareBodyMiddleware-&gt;__invoke(Object, Array) (Line: 31)\nGuzzleHttp\\Middleware::GuzzleHttp\\{closure}(Object, Array) (Line: 71)\nGuzzleHttp\\RedirectMiddleware-&gt;__invoke(Object, Array) (Line: 66)\nGuzzleHttp\\Middleware::GuzzleHttp\\{closure}(Object, Array) (Line: 75)\nGuzzleHttp\\HandlerStack-&gt;__invoke(Object, Array) (Line: 333)\nGuzzleHttp\\Client-&gt;transfer(Object, Array) (Line: 169)\nGuzzleHttp\\Client-&gt;requestAsync(&#039;GET&#039;, Object, Array) (Line: 189)\nGuzzleHttp\\Client-&gt;request(&#039;GET&#039;, &#039;http://127.0.0.1:11343/api/tags&#039;, Array) (Line: 106)\nDrupal\\provider_ollama\\OllamaControlApi-&gt;makeRequest(&#039;api/tags&#039;, Array, &#039;GET&#039;) (Line: 49)\nDrupal\\provider_ollama\\OllamaControlApi-&gt;getModels() (Line: 62)\nDrupal\\provider_ollama\\Plugin\\AiProvider\\OllamaProvider-&gt;getConfiguredModels(&#039;chat&#039;)\nReflectionMethod-&gt;invokeArgs(Object, Array) (Line: 106)\nDrupal\\ai\\Plugin\\ProviderProxy-&gt;wrapperCall(Object, Array) (Line: 78)\nDrupal\\ai\\Plugin\\ProviderProxy-&gt;__call(&#039;getConfiguredModels&#039;, Array) (Line: 127)\nDrupal\\ai\\Service\\AiProviderFormHelper-&gt;generateAiProvidersForm(Array, Object, &#039;chat&#039;, &#039;chat_&#039;, 2, 1003) (Line: 159)\nDrupal\\ai_api_explorer\\Form\\ChatGenerationForm-&gt;buildForm(Array, Object)\ncall_user_func_array(Array, Array) (Line: 536)\nDrupal\\Core\\Form\\FormBuilder-&gt;retrieveForm(&#039;ai_api_chat_generation&#039;, Object) (Line: 375)\nDrupal\\Core\\Form\\FormBuilder-&gt;rebuildForm(&#039;ai_api_chat_generation&#039;, Object, Array) (Line: 633)\nDrupal\\Core\\Form\\FormBuilder-&gt;processForm(&#039;ai_api_chat_generation&#039;, Array, Object) (Line: 326)\nDrupal\\Core\\Form\\FormBuilder-&gt;buildForm(Object, Object) (Line: 73)\nDrupal\\Core\\Controller\\FormController-&gt;getContentResult(Object, Object)\ncall_user_func_array(Array, Array) (Line: 123)\nDrupal\\Core\\EventSubscriber\\EarlyRenderingControllerWrapperSubscriber-&gt;Drupal\\Core\\EventSubscriber\\{closure}() (Line: 638)\nDrupal\\Core\\Render\\Renderer-&gt;executeInRenderContext(Object, Object) (Line: 121)\nDrupal\\Core\\EventSubscriber\\EarlyRenderingControllerWrapperSubscriber-&gt;wrapControllerExecutionInRenderContext(Array, Array) (Line: 97)\nDrupal\\Core\\EventSubscriber\\EarlyRenderingControllerWrapperSubscriber-&gt;Drupal\\Core\\EventSubscriber\\{closure}() (Line: 181)\nSymfony\\Component\\HttpKernel\\HttpKernel-&gt;handleRaw(Object, 1) (Line: 76)\nSymfony\\Component\\HttpKernel\\HttpKernel-&gt;handle(Object, 1, 1) (Line: 53)\nDrupal\\Core\\StackMiddleware\\Session-&gt;handle(Object, 1, 1) (Line: 48)\nDrupal\\Core\\StackMiddleware\\KernelPreHandle-&gt;handle(Object, 1, 1) (Line: 28)\nDrupal\\Core\\StackMiddleware\\ContentLength-&gt;handle(Object, 1, 1) (Line: 32)\nDrupal\\big_pipe\\StackMiddleware\\ContentLength-&gt;handle(Object, 1, 1) (Line: 106)\nDrupal\\page_cache\\StackMiddleware\\PageCache-&gt;pass(Object, 1, 1) (Line: 85)\nDrupal\\page_cache\\StackMiddleware\\PageCache-&gt;handle(Object, 1, 1) (Line: 48)\nDrupal\\Core\\StackMiddleware\\ReverseProxyMiddleware-&gt;handle(Object, 1, 1) (Line: 51)\nDrupal\\Core\\StackMiddleware\\NegotiationMiddleware-&gt;handle(Object, 1, 1) (Line: 36)\nDrupal\\Core\\StackMiddleware\\AjaxPageState-&gt;handle(Object, 1, 1) (Line: 51)\nDrupal\\Core\\StackMiddleware\\StackedHttpKernel-&gt;handle(Object, 1, 1) (Line: 741)\nDrupal\\Core\\DrupalKernel-&gt;handle(Object) (Line: 19)\n", name: "AjaxError", stack: "@https://drupal10.ddev.site/sites/default/files/js/js_jQzZ_qeL-aNiRqFwdB8MFdA5vskuvL7sZ7mgMrXNFuQ.js?scope=footer&delta=0&language=en&theme=claro&include=eJx9juEOgjAMhF9obs_gk5BSTpmOdXaF6NsLCiYY46-23-Wux5INdxsphU7HQsnzhxxSzNfqWBSbSGxxwkvYcZaUqFTsYDUyrP44h2qe4eU2Qh_-JDo4E0ktaVino26IuVmvpoKU-_Aejv8V_RmwxXpTwPcyQZcW2b7_elSmguNidm08NyUWhG15AmcicU0:180:2411\n@https://drupal10.ddev.site/sites/default/files/js/js_jQzZ_qeL-aNiRqFwdB8MFdA5vskuvL7sZ7mgMrXNFuQ.js?scope=footer&delta=0&language=en&theme=claro&include=eJx9juEOgjAMhF9obs_gk5BSTpmOdXaF6NsLCiYY46-23-Wux5INdxsphU7HQsnzhxxSzNfqWBSbSGxxwkvYcZaUqFTsYDUyrP44h2qe4eU2Qh_-JDo4E0ktaVino26IuVmvpoKU-_Aejv8V_RmwxXpTwPcyQZcW2b7_elSmguNidm08NyUWhG15AmcicU0:180:19740\n" }
js_jQzZ_qeL-aNiRqFwdB8MFdA5vskuvL7sZ7mgMrXNFuQ.js:180:2411

Comment about 1 year ago →
🇩🇰Denmark ressa Copenhagen
The challenge is probably to get DDEV connected to an IP on the host machine, I think ...
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
Yes, either you have to setup Ollama in a DDEV docker container or you can try this for hostname:

host.docker.internal

It should work in most cases for connecting to the Docker host.
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
Also make sure to start Ollama so it listens to world in that case, since it only listens to localhost by default and your DDEV machine is not localhost.

https://github.com/ollama/ollama/issues/703
Comment about 1 year ago →
🇩🇰Denmark ressa Copenhagen
Thanks for fast answers @Marcus_Johansson!

Do you have slightly more concrete suggestions? (like "Insert this in the Ollama config file /home/user/.ollama/config.env, add IP: host.docker.internal in the DDEV config file, and restart DDEV")

My thinking is that, as soon as I get it working with DDEV (which is the officially recommended developer tool) I'll document it in README, and the AI documentation → . But rather than spending a lot of time experimenting with different solutions from the page you link to, more concrete actionable suggestions would save me a lot of time, which I can use better on documenting using Drupal AI module and Ollama in DDEV.
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
Ah, so something like this.

1. Start Ollama with an environment parameter, something like - OLLAMA_HOST="0.0.0.0" ollama serve
2. When you define the hostname in the page /admin/config/ai/providers/ollama it should be http://host.docker.internal. This is a special hostname that connects to the docker parent host.

That should work on Linux and 99% work on Mac. On Windows it would be two commands and some special Windows sauce to setup Ollama, so something like

set OLLAMA_HOST=0.0.0.0
ollama serve
approve the firewall

If it doesn't work on Windows let me know and I'll test on my gaming computer.
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
I might do a video about it, since with Gemma, it would actually be a showcase for something that you can host on a webserver/your own none-GPU laptop with 16GB of RAM.
Comment about 1 year ago →
🇩🇰Denmark ressa Copenhagen
Beautiful thorough description, thanks! I'll try it later today, and add it to the documentation after verifying. I am on Debian 12, but if you verify if it works on Windows also, that would be very nice as well.

A video would be fantastic, and perhaps you could include an uncensored LLM such as dolphin-llama3, to cover that aspect of AI as well? Or do that model instead of Gemma (Google), if the steps are identical? Or do both? :)
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
@ressa - check out here: https://youtu.be/LFFoGfYFMn4
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
The steps should be identical - I used Gemma so its a showcase that most modern laptops can run.
Comment about 1 year ago →
🇩🇰Denmark ressa Copenhagen
Thanks for the video, it's great!

(It looks like the bottom of the screen is missing, where the commands are shown ... also, perhaps you could consider making the fonts slightly bigger in your videos? But let me emphasize that I very much appreciate your videos, they really are very good. The smallish font is only a minor beauty mark.)

I tried to follow your tips, and am getting close, though I can't connect to the models ...

Firewall

In Debian 12, I opened port 11434 for Ollama in ufw, using Gufw:

Open "Report"

Select "ollama"

Click "+" to create rule

Select "Policy: Allow" and "Direction: Both"

... which created this rule:

$ sudo iptables -S [...] -A ufw-user-input -p tcp -m tcp --dport 11434 -j ACCEPT -A ufw-user-output -p tcp -m tcp --dport 11434 -j ACCEPT
Ollama

Stop Ollama and serve with 0.0.0.0 as IP, check before with netstat (install net-tools):

$ sudo netstat -tunlp | grep 11434 tcp 0 0 127.0.0.1:11434 0.0.0.0:* LISTEN 10728/ollama $ sudo systemctl stop ollama $ sudo netstat -tunlp | grep 11434
The last command gives no result.

Serve under 0.0.0.0 and check with netstat, and check Ollama IP's:

$ OLLAMA_HOST=0.0.0.0 ollama serve $ sudo netstat -tunlp | grep 11434 tcp6 0 0 :::11434 :::* LISTEN 11027/ollama $ curl http://127.0.0.1:11434 Ollama is running $ curl http://0.0.0.0:11434 Ollama is running
DDEV and host.docker.internal

Check inside DDEV:

$ ddev ssh $ ping host.docker.internal PING host.docker.internal (172.17.0.1) 56(84) bytes of data. 64 bytes from host.docker.internal (172.17.0.1): icmp_seq=1 ttl=64 time=0.123 ms [...] $ curl host.docker.internal:11434 Ollama is running
... but no models are available:

$ curl host.docker.internal:11434/api/tags {"models":[]}
Ollama Authentication

Add these values:

Host Name: http://host.docker.internal
Port: 11434

Chat explorer

When I select Ollama, there are no models in the dropdown ...

PS. I spent a looong time trying to make it work with port 11343, since it is the port number in the placeholder :-) I'll create a MR to fix this.
Merge request !6Update placeholder port number to 11434, add link to AI provider documentation. → (Merged) created by ressa
Status changed to Needs review about 1 year ago12:02pm 23 June 2024
Comment about 1 year ago →
🇩🇰Denmark ressa Copenhagen
Pipeline finished with Success
about 1 year ago
Total: 150s
#206064
Comment about 1 year ago →
🇩🇰Denmark ressa Copenhagen
I found the missing piece of the puzzle. Maybe there's a better way? Anyhow, I needed to also start Ollama, which I assumed ollama serve would take care of, but looks like it doesn't ... I absolutely be mistaken, and doing something wrong?

I tried to write a list of the steps, and created AI > How to set up a provider → .
Comment about 1 year ago →
🇩🇰Denmark ressa Copenhagen
Update Issue Summary.
Comment about 1 year ago →
🇩🇰Denmark ressa Copenhagen
Add link.
Status changed to Fixed about 1 year ago11:57am 24 June 2024
Comment about 1 year ago →
🇩🇪Germany marcus_johansson
Merged and thanks for you work!

ollama serve should be the main command to make it start listening. You can run ollama without it, via ollama run, so its a little bit strange.

I added this follow up ticket as good-to-have: https://www.youtube.com/watch?v=LFFoGfYFMn4&ab_channel=DrupalAIVideos
Comment about 1 year ago →
🇩🇰Denmark ressa Copenhagen
Thanks!

It appears in your video (I am not sure, since the commands in the video are below the screen, see comment #22 📌 Ollama LLM Provider Active at the bottom) that in your video, you pull the LLM, which also starts it ...

If you didn't do that step, it might not be running/available, and be available for DDEV perhaps?

Being able to pull and delete models for Ollama in Web UI would be nice, thanks!
Comment about 1 year ago →
System Message
Automatically closed - issue fixed for 2 weeks with no activity.
Comment about 1 year ago →
🇱🇹Lithuania mindaugasd
Simple solutions for

how to make it easy enough and accessible to regular Drupal CMS users. How does one install it on the server actually. How much knowledge, investment and experience does it require to do it in the proper way.

is ✨ WebGPU support Active
Comment about 1 month ago →
🇺🇸United States Kristen Pol Santa Cruz, CA, USA
We are doing some issue management housekeeping and adding/removing components.

We are removing the "Code" component and want people to categorize issues with the best module/submodule component.

Moving this issue to "Miscellaneous" as I'm not sure where the best place is.

If we have a new project to move this to, we can optionally do that as a secondary step.

See 📌 Update AI module project components Active for more details.

Problem/Motivation

Merge Requests

!6Ollama LLM Provider
Merged

Comments & Activities

System

Modules

Setup Ollama Authentication

Ollama

AI Chat Explorer

Firewall

Ollama

DDEV and `host.docker.internal`

Ollama Authentication

Chat explorer

Ollama LLM Provider

Problem/Motivation

Merge Requests

!6Ollama LLM ProviderMerged

Comments & Activities

System

Modules

Setup Ollama Authentication

Ollama

AI Chat Explorer

Firewall

Ollama

DDEV and host.docker.internal

Ollama Authentication

Chat explorer

!6Ollama LLM Provider
Merged

DDEV and `host.docker.internal`