BethanyG BethanyG

Grapheme tokenisation in Python

When working with tokenisation and break iterators, it is sometimes necessary to work at the character, syllable, line, or sentence levels. Character level tokenisation is an interesting case. By character, I mean a user perceivable unit of text, which the Unicode standard would refer to as a grapheme. The usual way I see developers handling character level tokenisation of English is via list comprehension or typecasting a string to a list:

>>> t1 = "transformation"
>>> [char for char in t1]
['t', 'r', 'a', 'n', 's', 'f', 'o', 'r', 'm', 'a', 't', 'i', 'o', 'n']

get github stats

https://github.com/anuraghazra/github-readme-stats

The Zoom install package for macOS is mad. Rather than actually using the installer to install things, it does everything in the preinstall script. That's bonkers, and also means that the system won't have a list of the files it installed, because it's doing it using shell script.

The script appears to install two items, namely:

/Applications/zoom.us.app
~/Library/Internet Plug-Ins/ZoomUsPlugIn.plugin

If the user opening the package isn't an administrator, it looks like it will install the app in the user's home folder instead. If they are an administrator, Zoom will delete the ZoomUsPlugIn.plugin from /Library if it's there, but it still installs to ~/Library.

It also adds Zoom to your Dock automatically, without asking.

Python `@property` inheritance the right way

Given a Parent class with value property, Child can inherit and overload the property while accessing Parent property getter and setter.

Although we could just reimplement the Child.value property logic completely without using Parent.value whatsover, this would violate the DRY principle and, more important, it wouldn't allow for proper multiple inheritance (as show in the example property_inheritance.py bellow).

Two options:

Child redefines value property completely, both getter and setter.

Markdown - Resize pictures in GitHub

I found that the "best" way is to use HTML, as it works both in Readme/.md files and also in comments (within Issues, Gist...)

E.g. when adding/editing a comment (within Issues, Gist...) :

Upload the picture by drag-and-drop in the text field
replace ![image](https://your-image-url.type) with <img src="https://your-image-url.type" width="100" height="100">

As mentioned by @cbestow (thanks!), it's not mandatory to set both width and height. If only one is set, the other will be adjusted accordingly to preserve the aspect ratio of the image.

	"""
	Script to print all Unicode flag emoji are also a valid flag when reversed.

	Output of this script:

	🇦🇬 (Antigua and Barbuda) reverses to 🇬🇦 (Gabon)
	🇦🇱 (Albania) reverses to 🇱🇦 (Lao People's Democratic Republic)
	🇦🇲 (Armenia) reverses to 🇲🇦 (Morocco)
	🇦🇶 (Antarctica) reverses to 🇶🇦 (Qatar)
	🇦🇸 (American Samoa) reverses to 🇸🇦 (Saudi Arabia)

	# .style.yapf
	#
	# DESCRIPTION
	# Configuration file for the python formatter yapf.
	#
	# This configuration is based on the generic
	# configuration published on GitHub.
	#
	# AUTHOR
	# krnd

BethanyG BethanyG

Grapheme tokenisation in Python

get github stats

Python @property inheritance the right way

Markdown - Resize pictures in GitHub

Python `@property` inheritance the right way