0% found this document useful (0 votes)

39 views5 pages

NKF

Guia de uso de nkf en contenedores

Uploaded by

codigolibrecol

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views5 pages

NKF

Guia de uso de nkf en contenedores

Uploaded by

codigolibrecol

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 5

nkf(1)

NAME
nkf - Network Kanji Filter

SYNOPSIS
nkf [-butjnesliohrTVvwWJESZxXFfmMBOcdILg] [file ...]

DESCRIPTION
Nkf is a yet another kanji code converter among networks, hosts and
terminals. It converts input kanji code to designated
kanji code such as ISO-2022-JP, Shift_JIS, EUC-JP, UTF-8, UTF-16 or UTF-32.

One of the most unique faculty of nkf is the guess of the input kanji
encodings. It currently recognizes ISO-2022-JP,
Shift_JIS, EUC-JP, UTF-8, UTF-16 and UTF-32. So users needn't set the input
kanji code explicitly.

By default, X0201 kana is converted into X0208 kana. For X0201 kana, SO/SI,
SSO and ESC-(-I methods are supported. For
automatic code detection, nkf assumes no X0201 kana in Shift_JIS. To accept
X0201 in Shift_JIS, use -X, -x or -S.

multiple options are specified as seprate strings, such as

print nkf('--ic=UTF8-MAC', '-w', $string), "\n";

except the last arguments.

OPTIONS
-J -S -E -W -W16 -W32 -j -s -e -w -w16 -w32
Specify input and output encodings. Upper case is input. cf. --ic and
--oc.

-J ISO-2022-JP (JIS code).

-S Shift_JIS and JIS X 0201 kana. EUC-JP is recognized as X0201 kana.

Without -x flag, JIS X 0201 Katakana
(a.k.a.halfwidth kana) is converted into JIS X 0208. If you use
Windows, see Windows-31J (CP932).

-E EUC-JP.

-W UTF-8N.

-W16[BL][0]
UTF-16. B or L gives whether Big Endian or Little Endian. 0 gives
whther put BOM or not.

-W32[BL][0]
UTF-32. B or L gives whether Big Endian or Little Endian. 0 gives
whther put BOM or not.

-b -u
Output is buffered (DEFAULT), Output is unbuffered.

-t No conversion.
-i[@B]
Specify the escape sequence for JIS X 0208.

-i@ Use ESC ( @. (JIS X 0208-1978)

-iB Use ESC ( B. (JIS X 0208-1983/1990 DEFAULT)

-o[BJ]
Specify the escape sequence for US-ASCII/JIS X 0201 Roman. (DEFAULT B)

-r {de/en}crypt ROT13/47

-h[123] --hiragana --katakana --katakana-hiragana

-h1 --hiragana
Katakana to Hiragana conversion.

-h2 --katakana
Hiragana to Katakana conversion.

-h3 --katakana-hiragana
Katakana to Hiragana and Hiragana to Katakana conversion.

-T Text mode output (MS-DOS)

-f[m [- n]]
Folding on m length with n margin in a line. Without this option, fold
length is 60 and fold margin is 10.

-F New line preserving line folding.

-Z[0-3]
Convert X0208 alphabet (Fullwidth Alphabets) to ASCII.

-Z -Z0
Convert X0208 alphabet to ASCII.

-Z1 Convert X0208 kankaku to single ASCII space.

-Z2 Convert X0208 kankaku to double ASCII spaces.

-Z3 Replacing fullwidth >, <, ", & into '>', '<', '"',
'&' as in HTML.

-X -x
With -X or without this option, X0201 is converted into X0208 Kana.
With -x, try to preserve X0208 kana and do not
convert X0201 kana to X0208. In JIS output, ESC-(-I is used. In EUC
output, SS2 is used.

-B[0-2]
Assume broken JIS-Kanji input, which lost ESC. Useful when your site is
using old B-News Nihongo patch.

-B1 allows any chars after ESC-( or ESC-$.

-B2 force ASCII after NL.

-I Replacing non iso-2022-jp char into a geta character (substitute

character in Japanese).
-m[BQN0]
MIME ISO-2022-JP/ISO8859-1 decode. (DEFAULT) To see ISO8859-1 (Latin-1)
-l is necessary.

-mB Decode MIME base64 encoded stream. Remove header or other part
before conversion.

-mQ Decode MIME quoted stream. '_' in quoted stream is converted to

space.

-mN Non-strict decoding. It allows line break in the middle of the

base64 encoding.

-m0 No MIME decode.

-M MIME encode. Header style. All ASCII code and control characters are
intact.

-MB MIME encode Base64 stream. Kanji conversion is performed before

encoding, so this cannot be used as a picture
encoder.

-MQ Perform quoted encoding.

-l Input and output code is ISO8859-1 (Latin-1) and ISO-2022-JP. -s, -e

and -x are not compatible with this option.

-L[uwm] -d -c
Convert line breaks.

-Lu -d
unix (LF)

-Lw -c
windows (CRLF)

-Lm mac (CR)

Without this option, nkf doesn't convert line breaks.

--fj --unix --mac --msdos --windows

Convert for these systems.

--jis --euc --sjis --mime --base64

Convert to named code.

--jis-input --euc-input --sjis-input --mime-input --base64-input

Assume input system

--ic=input codeset --oc=output codeset

Set the input or output codeset. NKF supports following codesets and
those codeset names are case insensitive.

ISO-2022-JP
a.k.a. RFC1468, 7bit JIS, JUNET

EUC-JP (eucJP-nkf)
a.k.a. AT&T JIS, Japanese EUC, UJIS
eucJP-ascii
eucJP-ms
CP51932
Microsoft Version of EUC-JP.

Shift_JIS
a.k.a. SJIS, MS_Kanji

Windows-31J
a.k.a. CP932

UTF-8
same as UTF-8N

UTF-8N
UTF-8 without BOM

UTF-8-BOM
UTF-8 with BOM

UTF8-MAC (input only)

decomposed UTF-8

UTF-16
same as UTF-16BE

UTF-16BE
UTF-16 Big Endian without BOM

UTF-16BE-BOM
UTF-16 Big Endian with BOM

UTF-16LE
UTF-16 Little Endian without BOM

UTF-16LE-BOM
UTF-16 Little Endian with BOM

UTF-32
same as UTF-32BE

UTF-32BE
UTF-32 Big Endian without BOM

UTF-32BE-BOM
UTF-32 Big Endian with BOM

UTF-32LE
UTF-32 Little Endian without BOM

UTF-32LE-BOM
UTF-32 Little Endian with BOM

--fb-{skip, html, xml, perl, java, subchar}

Specify the way that nkf handles unassigned characters. Without this
option, --fb-skip is assumed.

--prefix=escape charactertarget character..

When nkf converts to Shift_JIS, nkf adds a specified escape character to
specified 2nd byte of Shift_JIS characters. 1st
byte of argument is the escape character and following bytes are target
characters.

--no-cp932ext
Handle the characters extended in CP932 as unassigned characters.

--no-best-fit-chars
When Unicode to Encoded byte conversion, don't convert characters which
is not round trip safe. When Unicode to Unicode
conversion, with this and -x option, nkf can be used as UTF converter.
(In other words, without this and -x option, nkf
doesn't save some characters)

When nkf converts strings that related to path, you should use this
option.

--cap-input
Decode hex encoded characters.

--url-input
Unescape percent escaped characters.

--numchar-input
Decode character reference, such as "&#....;".

--in-place[=SUFFIX] --overwrite[=SUFFIX]
Overwrite original listed files by filtered result.

Note --overwrite preserves timestamps of original files.

--guess=[12]
Print guessed encoding and newline. (2 is default, 1 is only encoding)

--help
Print nkf's help.

--version
Print nkf's version.

-- Ignore rest of -option.

nkf 2.1.5 2018-12-15

nkf(1)

Japanese Character Code Sets & Encodings:: History
100% (2)
Japanese Character Code Sets & Encodings:: History
8 pages
Corp Char
No ratings yet
Corp Char
6 pages
Jananese Text Pentru Download
No ratings yet
Jananese Text Pentru Download
228 pages
Ctlseqs
No ratings yet
Ctlseqs
49 pages
Japanese
No ratings yet
Japanese
129 pages
Msgen
No ratings yet
Msgen
3 pages
Crunch Tool
No ratings yet
Crunch Tool
4 pages
DD
No ratings yet
DD
3 pages
Apple Cat Patch
No ratings yet
Apple Cat Patch
2 pages
Foliera
No ratings yet
Foliera
55 pages
Fix Text
No ratings yet
Fix Text
8 pages
Jupiteryo
No ratings yet
Jupiteryo
11 pages
Keyfile Lists
No ratings yet
Keyfile Lists
61 pages
MSFencode - Metasploit Unleashed
No ratings yet
MSFencode - Metasploit Unleashed
7 pages
Chin Simp
No ratings yet
Chin Simp
132 pages
Expand - More Expand - More Expand - More Expand - More Expand - More Expan D - More Expand - More Expand - More
No ratings yet
Expand - More Expand - More Expand - More Expand - More Expand - More Expan D - More Expand - More Expand - More
5 pages
Solaris UNIX Command Guide
No ratings yet
Solaris UNIX Command Guide
5 pages
Workshop Day5
No ratings yet
Workshop Day5
6 pages
Command Line
No ratings yet
Command Line
9 pages
Linuxfile
No ratings yet
Linuxfile
9 pages
Ommon Unix Commands and Utilities
No ratings yet
Ommon Unix Commands and Utilities
9 pages
Perl Encode Module Guide
No ratings yet
Perl Encode Module Guide
9 pages
Ncurses Programming Howto
No ratings yet
Ncurses Programming Howto
2 pages
ASCII Art Assignment 8 Q3
No ratings yet
ASCII Art Assignment 8 Q3
3 pages
ANSI Escape Codes
50% (2)
ANSI Escape Codes
40 pages
Manipulating Binary Data Using The Korn Shell
No ratings yet
Manipulating Binary Data Using The Korn Shell
5 pages
The American Standard Code For Information Interchange
100% (1)
The American Standard Code For Information Interchange
4 pages
Lex and Yacc Examples Lab Task
No ratings yet
Lex and Yacc Examples Lab Task
6 pages
1 - Basic Shell Commands
No ratings yet
1 - Basic Shell Commands
3 pages
CS19001: Programming and Data Structures Laboratory: Date: 18-October-2019
No ratings yet
CS19001: Programming and Data Structures Laboratory: Date: 18-October-2019
2 pages
Editor
No ratings yet
Editor
7 pages
CS 200 - Introduction To Programming: Assignment 1
No ratings yet
CS 200 - Introduction To Programming: Assignment 1
8 pages
Helpnano
No ratings yet
Helpnano
11 pages
Y23cs001 Soclabrecord
No ratings yet
Y23cs001 Soclabrecord
35 pages
OS Lab Solutions
50% (2)
OS Lab Solutions
90 pages
Fdstd144 Ima Hex
No ratings yet
Fdstd144 Ima Hex
1,668 pages
Pipingfile
No ratings yet
Pipingfile
11 pages
Linux Fedora Man - K Files
No ratings yet
Linux Fedora Man - K Files
124 pages
U1B000
No ratings yet
U1B000
7 pages
Python For You and Me
No ratings yet
Python For You and Me
74 pages
Cygwin Setup
No ratings yet
Cygwin Setup
1 page
Escape Characters
No ratings yet
Escape Characters
6 pages
TR PDF
No ratings yet
TR PDF
2 pages
X11
No ratings yet
X11
3 pages
Unix SFTP Automation Guide
No ratings yet
Unix SFTP Automation Guide
22 pages
Dhruv Pandit: Name: Class:Cba Enrolment No: Batch: Cse - 21
No ratings yet
Dhruv Pandit: Name: Class:Cba Enrolment No: Batch: Cse - 21
12 pages
Aaa
No ratings yet
Aaa
4 pages
Index
No ratings yet
Index
8 pages
RULES
No ratings yet
RULES
7 pages
Unix and AIX Commands For Basis and Oracle DBA Consultant
No ratings yet
Unix and AIX Commands For Basis and Oracle DBA Consultant
11 pages
ASCII Chart and Other Resources
No ratings yet
ASCII Chart and Other Resources
8 pages
Java EdText Utility: API Implementation
No ratings yet
Java EdText Utility: API Implementation
10 pages
Programming in LUA
No ratings yet
Programming in LUA
43 pages
Exp 8 - 22202A0063
No ratings yet
Exp 8 - 22202A0063
4 pages
Hex Dump
No ratings yet
Hex Dump
5 pages
Phys341 Doc 5
No ratings yet
Phys341 Doc 5
4 pages
ACC252 Problems 5
No ratings yet
ACC252 Problems 5
4 pages
LAT264 Questions 6
No ratings yet
LAT264 Questions 6
4 pages
SAS 9.2 National Language Support (NLS) - Reference Guide
No ratings yet
SAS 9.2 National Language Support (NLS) - Reference Guide
683 pages
Speed Building Workout
No ratings yet
Speed Building Workout
3 pages
Spanish Waltz Guitar Sheet Music
No ratings yet
Spanish Waltz Guitar Sheet Music
2 pages
Get-Pip Py
No ratings yet
Get-Pip Py
460 pages
Java String Capitalization Techniques
No ratings yet
Java String Capitalization Techniques
12 pages
అన్నా చెల్లి వరసా ఐనా మెము డెంగిన్‌చుకునాము - సెక్స్ బాబా
No ratings yet
అన్నా చెల్లి వరసా ఐనా మెము డెంగిన్‌చుకునాము - సెక్స్ బాబా
489 pages
Lamentos do Morro Guitar Tab
No ratings yet
Lamentos do Morro Guitar Tab
4 pages
Gallery - Setup - Rpy - 2025-09-15T101411.374
No ratings yet
Gallery - Setup - Rpy - 2025-09-15T101411.374
168 pages
Image and Video Compression
No ratings yet
Image and Video Compression
18 pages
Resolucion Jefatural #060-2003-Jef-Reniec
No ratings yet
Resolucion Jefatural #060-2003-Jef-Reniec
31 pages
エクステンドアッシュ - ～蓬莱人 (Extend Ash ~ Hourai Victim) ソロギターアレンジ
No ratings yet
エクステンドアッシュ - ～蓬莱人 (Extend Ash ~ Hourai Victim) ソロギターアレンジ
4 pages
Source Code Python Jemmy
No ratings yet
Source Code Python Jemmy
7 pages
C Escape Sequences & ASCII Table
No ratings yet
C Escape Sequences & ASCII Table
3 pages
CursiveHandwritingPacketSuperStudent PDF
100% (5)
CursiveHandwritingPacketSuperStudent PDF
77 pages
Ascii
No ratings yet
Ascii
1 page
03 Here The Deities Approve T
No ratings yet
03 Here The Deities Approve T
1 page
Radix 64
No ratings yet
Radix 64
8 pages
Cypress Hill - Tequila Sunrise
No ratings yet
Cypress Hill - Tequila Sunrise
2 pages
Define Psychovisual Redundancy
No ratings yet
Define Psychovisual Redundancy
3 pages
Ascii Table - ASCII Character Codes and HTML, Octal, Hex and Dec
No ratings yet
Ascii Table - ASCII Character Codes and HTML, Octal, Hex and Dec
3 pages
Untitled
No ratings yet
Untitled
2,346 pages
Lecture#05 ICT
No ratings yet
Lecture#05 ICT
35 pages
Thresold and Limit Check
No ratings yet
Thresold and Limit Check
3 pages
Python String Methods Guide
No ratings yet
Python String Methods Guide
4 pages
BITS 2513 - Internet Technology Presentation Layer
No ratings yet
BITS 2513 - Internet Technology Presentation Layer
54 pages
Subaraya Nagenahalli Village Map
No ratings yet
Subaraya Nagenahalli Village Map
1 page
Universal Extractor - Compressed Archive Extractor
No ratings yet
Universal Extractor - Compressed Archive Extractor
2 pages
Chap 7
No ratings yet
Chap 7
51 pages
Suicidal Tendencies S Nobody Hears-Rhythm George
No ratings yet
Suicidal Tendencies S Nobody Hears-Rhythm George
9 pages
Built in String Methods Python
No ratings yet
Built in String Methods Python
2 pages

NKF

Uploaded by

NKF

Uploaded by

nkf(1)

multiple options are specified as seprate strings, such as

print nkf('--ic=UTF8-MAC', '-w', $string), "\n";

except the last arguments.

-J ISO-2022-JP (JIS code).

-S Shift_JIS and JIS X 0201 kana. EUC-JP is recognized as X0201 kana.

-i@ Use ESC ( @. (JIS X 0208-1978)

-iB Use ESC ( B. (JIS X 0208-1983/1990 DEFAULT)

-h[123] --hiragana --katakana --katakana-hiragana

-T Text mode output (MS-DOS)

-F New line preserving line folding.

-Z1 Convert X0208 kankaku to single ASCII space.

-Z2 Convert X0208 kankaku to double ASCII spaces.

-B1 allows any chars after ESC-( or ESC-$.

-B2 force ASCII after NL.

-I Replacing non iso-2022-jp char into a geta character (substitute

-mQ Decode MIME quoted stream. '_' in quoted stream is converted to

-mN Non-strict decoding. It allows line break in the middle of the

-m0 No MIME decode.

-MB MIME encode Base64 stream. Kanji conversion is performed before

-MQ Perform quoted encoding.

-l Input and output code is ISO8859-1 (Latin-1) and ISO-2022-JP. -s, -e

-Lm mac (CR)

Without this option, nkf doesn't convert line breaks.

--fj --unix --mac --msdos --windows

--jis --euc --sjis --mime --base64

--jis-input --euc-input --sjis-input --mime-input --base64-input

--ic=input codeset --oc=output codeset

UTF8-MAC (input only)

--fb-{skip, html, xml, perl, java, subchar}

--prefix=escape charactertarget character..

Note --overwrite preserves timestamps of original files.

-- Ignore rest of -option.

Copyright (c) 1996-2018, The nkf Project.

nkf 2.1.5 2018-12-15

You might also like