elvish-str(7) | Miscellaneous Information Manual | elvish-str(7) |
The str: module provides string manipulation functions.
Function usages are given in the same format as in the reference doc for the builtin module.
str:compare $a $b
Compares two strings and output an integer that will be 0 if a == b, -1 if a < b, and +1 if a > b.
~> str:compare a a ▶ 0 ~> str:compare a b ▶ -1 ~> str:compare b a ▶ 1
str:contains $str $substr
Outputs whether $str contains $substr as a substring.
~> str:contains abcd x ▶ $false ~> str:contains abcd bc ▶ $true
str:contains-any $str $chars
Outputs whether $str contains any Unicode code points in $chars.
~> str:contains-any abcd x ▶ $false ~> str:contains-any abcd xby ▶ $true
str:count $str $substr
Outputs the number of non-overlapping instances of $substr in $s. If $substr is an empty string, output 1 + the number of Unicode code points in $s.
~> str:count abcdefabcdef bc ▶ 2 ~> str:count abcdef '' ▶ 7
str:equal-fold $str1 $str2
Outputs if $str1 and $str2, interpreted as UTF-8 strings, are equal under Unicode case-folding.
~> str:equal-fold ABC abc ▶ $true ~> str:equal-fold abc ab ▶ $false
str:from-codepoints $number...
Outputs a string consisting of the given Unicode codepoints. Example:
~> str:from-codepoints 0x61 ▶ a ~> str:from-codepoints 0x4f60 0x597d ▶ 你好
@cf str:to-codepoints
str:from-utf8-bytes $number...
Outputs a string consisting of the given Unicode bytes. Example:
~> str:from-utf8-bytes 0x61 ▶ a ~> str:from-utf8-bytes 0xe4 0xbd 0xa0 0xe5 0xa5 0xbd ▶ 你好
@cf str:to-utf8-bytes
str:has-prefix $str $prefix
Outputs if $str begins with $prefix.
~> str:has-prefix abc ab ▶ $true ~> str:has-prefix abc bc ▶ $false
str:has-suffix $str $suffix
Outputs if $str ends with $suffix.
~> str:has-suffix abc ab ▶ $false ~> str:has-suffix abc bc ▶ $true
str:index $str $substr
Outputs the index of the first instance of $substr in $str, or -1 if $substr is not present in $str.
~> str:index abcd cd ▶ 2 ~> str:index abcd xyz ▶ -1
str:index-any $str $chars
Outputs the index of the first instance of any Unicode code point from $chars in $str, or -1 if no Unicode code point from $chars is present in $str.
~> str:index-any "chicken" "aeiouy" ▶ 2 ~> str:index-any l33t aeiouy ▶ -1
str:join $sep $input-list?
Joins inputs with $sep. Examples:
~> put lorem ipsum | str:join , ▶ lorem,ipsum ~> str:join , [lorem ipsum] ▶ lorem,ipsum ~> str:join '' [lorem ipsum] ▶ loremipsum ~> str:join '...' [lorem ipsum] ▶ lorem...ipsum
Etymology: Various languages, Python (https://docs.python.org/3.6/library/stdtypes.html#str.join).
@cf str:split
str:last-index $str $substr
Outputs the index of the last instance of $substr in $str, or -1 if $substr is not present in $str.
~> str:last-index "elven speak elvish" elv ▶ 12 ~> str:last-index "elven speak elvish" romulan ▶ -1
str:replace &max=-1 $old $repl $source
Replaces all occurrences of $old with $repl in $source. If $max is non-negative, it determines the max number of substitutions.
Note: This command does not support searching by regular expressions, $old is always interpreted as a plain string. Use re:replace if you need to search by regex.
str:split $sep $string
Splits $string by $sep. If $sep is an empty string, split it into codepoints.
~> str:split , lorem,ipsum ▶ lorem ▶ ipsum ~> str:split '' 你好 ▶ 你 ▶ 好
Note: This command does not support splitting by regular expressions, $sep is always interpreted as a plain string. Use re:split if you need to split by regex.
Etymology: Various languages, in particular Python (https://docs.python.org/3.6/library/stdtypes.html#str.split).
@cf str:join
str:title $str
Outputs $str with all Unicode letters that begin words mapped to their Unicode title case.
~> str:title "her royal highness" ▶ Her Royal Highness
str:to-codepoints $string
Outputs value of each codepoint in $string, in hexadecimal. Examples:
~> str:to-codepoints a ▶ 0x61 ~> str:to-codepoints 你好 ▶ 0x4f60 ▶ 0x597d
The output format is subject to change.
@cf from-codepoints
str:to-lower $str
Outputs $str with all Unicode letters mapped to their lower-case equivalent.
~> str:to-lower 'ABC!123' ▶ abc!123
str:to-title $str
Outputs $str with all Unicode letters mapped to their Unicode title case.
~> str:to-title "her royal highness" ▶ HER ROYAL HIGHNESS ~> str:to-title "хлеб" ▶ ХЛЕБ
str:to-upper
Outputs $str with all Unicode letters mapped to their upper-case equivalent.
~> str:to-upper 'abc!123' ▶ ABC!123
str:to-utf8-bytes $string
Outputs value of each byte in $string, in hexadecimal. Examples:
~> str:to-utf8-bytes a ▶ 0x61 ~> str:to-utf8-bytes 你好 ▶ 0xe4 ▶ 0xbd ▶ 0xa0 ▶ 0xe5 ▶ 0xa5 ▶ 0xbd
The output format is subject to change.
@cf from-utf8-bytes
str:trim $str $cutset
Outputs $str with all leading and trailing Unicode code points contained in $cutset removed.
~> str:trim "¡¡¡Hello, Elven!!!" "!¡" ▶ 'Hello, Elven'
str:trim-left $str $cutset
Outputs $str with all leading Unicode code points contained in $cutset removed. To remove a prefix string use str:trim-prefix.
~> str:trim-left "¡¡¡Hello, Elven!!!" "!¡" ▶ 'Hello, Elven!!!'
str:trim-prefix $str $prefix
Outputs $str minus the leading $prefix string. If $str doesn’t begin with $prefix, $str is output unchanged.
~> str:trim-prefix "¡¡¡Hello, Elven!!!" "¡¡¡Hello, " ▶ Elven!!! ~> str:trim-prefix "¡¡¡Hello, Elven!!!" "¡¡¡Hola, " ▶ '¡¡¡Hello, Elven!!!'
str:trim-right $str $cutset
Outputs $str with all leading Unicode code points contained in $cutset removed. To remove a suffix string use str:trim-suffix.
~> str:trim-right "¡¡¡Hello, Elven!!!" "!¡" ▶ '¡¡¡Hello, Elven'
str:trim-space $str
Outputs $str with all leading and trailing white space removed as defined by Unicode.
~> str:trim-space " \t\n Hello, Elven \n\t\r\n" ▶ 'Hello, Elven'
str:trim-suffix $str $suffix
Outputs $str minus the trailing $suffix string. If $str doesn’t end with $suffix, $str is output unchanged.
~> str:trim-suffix "¡¡¡Hello, Elven!!!" ", Elven!!!" ▶ ¡¡¡Hello ~> str:trim-suffix "¡¡¡Hello, Elven!!!" ", Klingons!!!" ▶ '¡¡¡Hello, Elven!!!'
July 18, 2021 | Elvish 0.15.0 |