Skip to content

Instantly share code, notes, and snippets.

@573
Last active March 14, 2025 12:06
Show Gist options
  • Save 573/2dfc1cb2171dbe05e98f3d7ce0250b65 to your computer and use it in GitHub Desktop.
Save 573/2dfc1cb2171dbe05e98f3d7ce0250b65 to your computer and use it in GitHub Desktop.
fix for emacs wordlist utf-8 used for completion contains garbled isolatin when used with cape package (cape-dict)

Typing first three letters of a german word with umlaut contained in wordlist-german.txt into a utf-8 buffer pops up suggestions for completion but with garbled umlaut as in aalähnlich Dict. When I open the wordlist-german.txt in another buffer in parallel and start typing the word in scratch buffer (set to utf-8) the umlaut stays in utf-8 as in aalähnliches Dabbrev this time via dabbrev.

https://github.com/minad/cape/blob/cc9cf45c651ad4784444cfdbdf238ac18dce39c9/cape.el#L643C9-L643C23

:config (setq cape-dict-grep nil) (in use-package cape) as in attached .emacs file

$ file -i wordlist-german.txt wordlist-german.txt: text/plain; charset=utf-8

GNU Emacs 30.1 (build 2, x86_64-w64-mingw32) of 2025-02-23

packages (M-x list-packages) most recent versions

;; For Windows emacs this file belongs to $env:APPDATA
;; (GNU Emacs 29.4 (build 2, x86_64-w64-mingw32) of 2024-07-05)
;; If you don't have MELPA in your package archives:
(require 'package)
(add-to-list
'package-archives
'("melpa" . "http://melpa.org/packages/") t)
(add-to-list 'package-archives '("org" . "http://orgmode.org/elpa/") t) ; Org-mode's repository
(package-initialize)
(package-refresh-contents)
(custom-set-variables
;; custom-set-variables was added by Custom.
;; If you edit it by hand, you could mess it up, so be careful.
;; Your init file should contain only one such instance.
;; If there is more than one, they won't work right.
;; ;; hier landen eigene Wörter, die ispell noch nicht kannte
;; '(ispell-alternate-dictionary "C:\\Users\\dkahlenberg\\.aspell.LANG.pws")
;; https://github.com/minad/cape/issues/70
;; ('ispell-change-dictionary "C:/Users/dkahlenberg/wordlist-german.txt")
'(package-selected-packages
'(## bind-key cape corfu deft emacs-everywhere helm-org-rifle jinx
jira-markup-mode magit moe-theme org-dropbox org-plus-contrib
use-package weblogger websocket zetteldeft)))
(custom-set-faces
;; custom-set-faces was added by Custom.
;; If you edit it by hand, you could mess it up, so be careful.
;; Your init file should contain only one such instance.
;; If there is more than one, they won't work right.
)
;; ;;https://emacs.stackexchange.com/questions/27027/how-to-supply-ispell-program-with-dictionaries
(setenv "LC_ALL" "de_DE.UTF-8")
(setenv "DICPATH"
(concat (getenv "HOMEDRIVE") (getenv "HOMEPATH") ""))
(setenv "LANG" "de_DE")
;; from: https://github.com/jwiegley/use-package?tab=readme-ov-file#installing-use-package
;; This is only needed once, near the top of the file
(eval-when-compile
;; Following line is not needed if use-package.el is in ~/.emacs.d
;;(add-to-list 'load-path "<path where use-package is installed>")
(require 'use-package))
;; https://github.com/minad/jinx/issues/10
;;(use-package jinx
;; :bind (("M-?" . jinx-correct)
;; ("M-C-k" . jinx-languages))
;; :init
;; (add-hook 'emacs-startup-hook #'global-jinx-mode))
(use-package bind-key
:demand t)
(use-package corfu
;; Optional customizations
:custom
;; (corfu-cycle t) ;; Enable cycling for `corfu-next/previous'
(corfu-auto t) ;; Enable auto completion
;; (corfu-separator ?\s) ;; Orderless field separator
;; (corfu-quit-at-boundary nil) ;; Never quit at completion boundary
;; (corfu-quit-no-match nil) ;; Never quit, even if there is no match
;; (corfu-preview-current nil) ;; Disable current candidate preview
;; (corfu-preselect 'prompt) ;; Preselect the prompt
;; (corfu-on-exact-match nil) ;; Configure handling of exact matches
;; (corfu-scroll-margin 5) ;; Use scroll margin
;; Enable Corfu only for certain modes. See also `global-corfu-modes'.
;; :hook ((prog-mode . corfu-mode)
;; (shell-mode . corfu-mode)
;; (eshell-mode . corfu-mode))
;; Recommended: Enable Corfu globally. This is recommended since Dabbrev can
;; be used globally (M-/). See also the customization variable
;; `global-corfu-modes' to exclude certain modes.
:init
(global-corfu-mode))
(use-package emacs
:custom
;; TAB cycle if there are only few candidates
;; (completion-cycle-threshold 3)
;; Enable indentation+completion using the TAB key.
;; `completion-at-point' is often bound to M-TAB.
(tab-always-indent 'complete)
;; Emacs 30 and newer: Disable Ispell completion function. As an alternative,
;; try `cape-dict'.
(text-mode-ispell-word-completion nil)
;; Hide commands in M-x which do not apply to the current mode. Corfu
;; commands are hidden, since they are not used via M-x. This setting is
;; useful beyond Corfu.
(read-extended-command-predicate #'command-completion-default-include-p))
(use-package cape
;;:after corfu
;; Bind prefix keymap providing all Cape commands under a mnemonic key.
;; Press C-c p ? to for help.
:bind ("C-c p" . cape-prefix-map) ;; Alternative keys: M-p, M-+, ...
;; Alternatively bind Cape commands individually.
;; :bind (("C-c p d" . cape-dabbrev)
;; ("C-c p h" . cape-history)
;; ("C-c p f" . cape-file)
;; ...)
:init
;; Add to the global default value of `completion-at-point-functions' which is
;; used by `completion-at-point'. The order of the functions matters, the
;; first function returning a result wins. Note that the list of buffer-local
;; completion functions takes precedence over the global list.
(add-hook 'completion-at-point-functions #'cape-dabbrev)
(add-hook 'completion-at-point-functions #'cape-file)
(add-hook 'completion-at-point-functions #'cape-elisp-block)
(add-hook 'completion-at-point-functions #'cape-dict)
;; https://sourcegraph.com/github.com/MiniApollo/kickstart.emacs/-/blob/init.el?L320-322
;;(add-to-list 'completion-at-point-functions #'cape-dict) ;; Dictionary completion
;; ...
:config ;; start emacs ; C-x b *scratch* ; start typing ; C-c p w should suggest from dictionary now
(setq cape-dict-file "C:/Users/user/wordlist-german.txt") ;
(setq cape-dict-grep nil)
;; https://github.com/minad/cape/blob/ae98ec2/README.org#capf-transformers
;; Example 3: Create a Capf with debugging messages
;;(setq-local completion-at-point-functions (list (cape-capf-debug #'cape-dict)))
;; https://github.com/minad/cape?tab=readme-ov-file#super-capf---merging-multiple-capfs
;; Merge the dabbrev, dict and keyword capfs, display candidates together.
;;(setq-local completion-at-point-functions
;; (list (cape-capf-super #'cape-dabbrev #'cape-dict #'cape-keyword)))
;; Alternative: Define named Capf instead of using the anonymous Capf directly
(defun cape-dabbrev-dict-keyword ()
(cape-wrap-super #'cape-dabbrev #'cape-dict #'cape-keyword))
(setq-local completion-at-point-functions (list #'cape-dabbrev-dict-keyword))
)
;; FROM HERE: https://blog.binchen.org/posts/what-s-the-best-spell-check-set-up-in-emacs/
;; find aspell and hunspell automatically
;; (cond
;; ;; try hunspell at first
;; ;; if hunspell does NOT exist, use aspell
;; (
;; ;; C:\Users\user\AppData\Local\Microsoft\WinGet\Packages\FSFhu.Hunspell_Microsoft.Winget.Source_8wekyb3d8bbwe\hunspell.exe
;; (setq-default ispell-program-name (executable-find "hunspell.exe"))
;; ;; https://emacs.stackexchange.com/q/14056
;; ;; https://stackoverflow.com/a/4111580/3320256
;; (eval-after-load "ispell"
;; '(progn (defun ispell-get-coding-system () 'utf-8)))
;; ;; leider iso-latin, bräuchte utf-8-unix
;; (setq ispell-hunspell-dict-paths-alist '(("de_DE" "C:/Users/dkahlenberg/de_DE.aff")))
;; (setq ispell-local-dictionary "de_DE")
;; (setq ispell-local-dictionary-alist
;; ;; Please note the list `("-d" "en_US")` contains ACTUAL parameters passed to hunspell
;; ;; You could use `("-d" "en_US,en_US-med")` to check with multiple dictionaries
;; '(("de_DE" "[[:alpha:]]" "[^[:alpha:]]" "[']" nil ("-d" "de_DE") nil utf-8)))
;; ;; new variable `ispell-hunspell-dictionary-alist' is defined in Emacs
;; ;; If it's nil, Emacs tries to automatically set up the dictionaries.
;; (when (boundp 'ispell-hunspell-dictionary-alist)
;; (setq ispell-hunspell-dictionary-alist ispell-local-dictionary-alist)))
;; (
;; (setq-default ispell-program-name (executable-find "aspell.exe"))
;; ;; Please note ispell-extra-args contains ACTUAL parameters passed to aspell
;; (setq ispell-extra-args '("--sug-mode=ultra" "--lang=de_DE"))))
(setq deft-extensions '("org")
deft-directory "~/../../Documents/zettelkasten"
deft-recursive t
;; deft-new-file-format "%Y-%m-%dT%H%M"
;; deft-use-filename-as-title t
;; I tend to write org-mode titles with #+title: (i.e., uncapitalized). Also other org-mode code at the beginning is written in lower case.
;; In order to filter these from the deft summary, let’s alter the regular expression:
deft-strip-summary-regexp
(concat "\\("
"[\n\t]" ;; blank
"\\|^#\\+[a-zA-Z_]+:.*$" ;;org-mode metadata
"\\)")
;; Its original value was \\([\n ]\\|^#\\+[[:upper:]_]+:.*$\\).
deft-default-extension "org")
(setq deft-file-naming-rules
'((noslash . "-")
(nospace . "-")
(case-fn . downcase)))
(zetteldeft-set-classic-keybindings)
(setq org-todo-keywords
'((sequence "TODO" "FEEDBACK" "VERIFY" "|" "DONE" "DELEGATED")))
;; next line is an exception, melpa did not find atomic-chrome, so
;; I used the instructions as listed in this fork
;; https://github.com/KarimAziev/atomic-chrome?#manual-installation
;; ----(add-to-list 'load-path "~/.emacs.d/atomic-chrome/")
;; ----(require 'atomic-chrome)
;; ----(atomic-chrome-start-server)
;;(setq atomic-chrome-buffer-open-style 'frame)
;; (setq atomic-chrome-server-ghost-text-port 4002)
;;(setq atomic-chrome-extension-type-list '(atomic-chrome))
;;(setq atomic-chrome-default-major-mode 'jira-markup-mode)
(defun my/jira-markup-insert-code-block ()
"Insert markup for a block of code.
If Transient Mark mode is on and a region is active, it is made a
code block."
(interactive)
(insert "{code}\n" "\n{code}\n")
(backward-char 8))
(defun my/jira-markup-insert-table-header ()
"Insert markup for a table header."
(interactive)
(insert "||" "||")
(backward-char 2))
(defun my/jira-markup-insert-pipechar ()
"Insert a |."
(interactive)
(insert "|")
(backward-char 0))
(defun my/jira-markup-insert-table-cell ()
"Insert markup for a table cell."
(interactive)
(insert "|" "|")
(backward-char 1))
(defun my/jira-markup-insert-url ()
"Insert markup for an url link."
(interactive)
(insert "[" "|]")
(backward-char 2))
(defun my/jira-markup-insert-ticket-link ()
"Insert markup for a table cell."
(interactive)
(insert "[MGM-" "]")
(backward-char 1))
(defun my/jira-markup-insert-bareurl ()
"Insert markup for an url link without alt."
(interactive)
(insert "[" "]")
(backward-char 1))
;;(eval-after-load 'atomic-chrome-edit-mode
;; (define-key atomic-chrome-edit-mode-map (kbd "C-c s") 'my/jira-markup-insert-code-block))
;;(eval-after-load 'atomic-chrome-edit-mode
;; (define-key atomic-chrome-edit-mode-map (kbd "||") 'my/jira-markup-insert-table-header))
;;(eval-after-load 'atomic-chrome-edit-mode
;; (define-key atomic-chrome-edit-mode-map (kbd "| SPC") 'my/jira-markup-insert-pipechar))
;;(eval-after-load 'atomic-chrome-edit-mode
;; (define-key atomic-chrome-edit-mode-map (kbd "|\\") 'my/jira-markup-insert-table-cell))
;;(eval-after-load 'atomic-chrome-edit-mode
;; (define-key atomic-chrome-edit-mode-map (kbd "[ SPC") 'my/jira-markup-insert-url))
;;(eval-after-load 'atomic-chrome-edit-mode
;; (define-key atomic-chrome-edit-mode-map (kbd "[ M\\") 'my/jira-markup-insert-ticket-link))
;;(eval-after-load 'atomic-chrome-edit-mode
;; (define-key atomic-chrome-edit-mode-map (kbd "[]") 'my/jira-markup-insert-bareurl))
@573
Copy link
Author

573 commented Mar 14, 2025

This is the windows compatible version
https://codeload.github.com/minad/cape/zip/cc9cf45c651ad4784444cfdbdf238ac18dce39c9, keep only "cape-char.el" "cape-keyword.el" "cape.el"

add this as cape-pkg.el in unpacked folder here ~\AppData\Roaming\.emacs.d\elpa\cape-20250308.1213:

;; -*- no-byte-compile: t; lexical-binding: nil -*-
(define-package "cape" "20250308.1213"
  "Completion At Point Extensions."
  '((emacs  "28.1")
    (compat "30"))
  :url "https://github.com/minad/cape"
  :commit "cc9cf45c651ad4784444cfdbdf238ac18dce39c9"
  :revdesc "cc9cf45c651a"
  :keywords '("abbrev" "convenience" "matching" "completion" "text")
  :authors '(("Daniel Mendler" . "[email protected]"))
  :maintainers '(("Daniel Mendler" . "[email protected]")))

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment