The last post mentioned other groups' projects with dictionaries and language resources. I thought I'd mention some of the projects I've been working on in this area.

I've been creating build scripts with the LM BLD project ( http://www.distasis.com/cpp/lmbld.htm ) so that I'll have automated, repeatable steps to build programs, libraries and other types of packages. Here are some of things I've been working on.

The Moby project is a very nice dictionary resource. Using their thesaurus, I was able to create a word list and a simple dictionary in stardict format. I use it with Open Source programs like scramble.

The Strongs concordance is in the public domain. I've created a translation dictionary in stardict format with it.

I happen to like the stardict dictionary format. There are several nice programs that can work with that format. I wanted something lightweight that would work well on older systems or let me create my own GUI interfaces. The closest thing I could find to what I wanted was sdcv. However, there were a few issues I had with it. The biggest is that it requires glib as a dependency and I didn't want to install GTK+ related dependencies on my systems. The second issue I ran into was that it couldn't handle some of the newer versions of the stardict format. Since the code is GNU GPL licensed, I started with it and made several modifications and customizations. The result is sdcv2 which can be linked to my own Unicode shared libraries in place of glib if desired and can work with dictionaries in more recent stardict formats. It may not make use of all the latest features in the newer formats, but it can at least access information from them.

I've seen other projects that use the sdcv library as a back-end and create their own GUI for a dictionary program. It makes sense if the program uses GTK+, but it seems awkward for Qt or other GUI programs to require GTK+ related dependencies. With sdcv2, there are no GTK+ related dependencies.

I would love to find a dictionary with a FLTK GUI, especially if it can handle stardict format. Since, I haven't been able to find one, I may try to write one at some point. I've also been thinking about creating a pdcurses front end. When I use sdcv (or sdcv2) from the command line, certain systems like Windows can't handle input or output of certain Unicode characters correctly. I've added support for SDL 2.x, SDL2_ttf and the ability to work with a range of Unicode characters within the UCS-2 character set to pdcurses. I think pdcurses would make an interesting front end for a program using the sdcv2 library. It would work on any system that supports SDL 1.x or 2.x, including more unusual operating systems like Syllable and Haiku. Would like to hear from others who may be interested in or are working on similar projects.

The dictzip program compresses dictionary files. It uses an extension to the gzip format with extra fields to include information about the compressed dictionary. Files compressed with this format often use the .dz extension. You can use dictzip with stardict files to save space. dictzip is primarily a POSIX compliant program, so it doesn't convert well to certain systems. I was able to find a Windows port that limited the program's functionality, but did enough to get the job of compression done. I've made some modifications to it and am using it as a cross-platform method of compressing stardict dictionary files.

Several utilities and conversion programs were created for stardict in the stardict-tools project. Similar to stardict and sdcv, glib is a dependency for stardict-tools. There are a few tools that use a GTK+ front-end as well. I personally only use the stardict-tools to convert tab delimited files and files in babylon format to stardict. So, I modified the command line tools that do those conversions to build without glib. I also created my own makefile just to build the tools I use.

I've searched and I've yet to find a rhyming dictionary in stardict format. So, I'm working on creating one. It's a slow process. I've taken a public domain rhyming dictionary as a starting point and I'm in the process of editing it and converting it to the format I need.

I've also been searching for an Open Source C/C++ grammer checker, but I've yet to find one that I like.

These are just some of the projects I'm working on. If you're interested in comparing notes on these topics or if you have recommendations of other dictionary and word related projects you like, feel free to contact me ( http://www.distasis.com/contact.htm ).
It's hard to find public domain, Open Source and Creative Commons licensed language resources in formats that are easy for programs to work with. There are growing resources of scanned public domain books. Among them, you can find all kinds of dictionaries and references. Some sites even use an OCR to translate scanned documents to text formats. However, the translated versions are usually full of typographical errors.

There are a few projects out there that use Free, Open Source or Creative Commons licensing and have a goal of creating dictionaries or other references in accessible and searchable digital format, but not a lot. It would be nice to see more projects of this sort. The results could be useful with word processors and editors (such as LibreOffice, Abiword, SciTE), electronic dictionaries (such as stardict) and games (such as anagramarama and scramble).

Here are some of the projects I've located:

XDXF, the XML dictionary interchange format, project had a collection of dictionaries and language translation word lists they were working with and converting to various formats.
https://sourceforge.net/projects/xdxf/files/
They also have tools for converting between XDXF and other formats:
https://github.com/soshial/xdxf_makedict

The Moby project is a wonderful public domain resource. It includes word lists, thesaurus and more.
http://icon.shef.ac.uk/Moby/

The Free Dictionaries Project also provides downloads. If you want to translate one language to another, this is a useful, free resource:
http://www.dicts.info/uddl.php

SCOWL (Spell Checker Oriented Word Lists) and Friends has useful word lists and resources for spell checker utilities:
http://wordlist.aspell.net/

YAWL (Yet Another Word List) is based on the updated Public Domain ENABLE (Enhanced North American Benchmark Lexicon).
You can also find Libre licensed word lists in FLOSS games such as anagramarama.
http://www.sourcefiles.org/Games/Puzzle/Other_Word_Games/

Here's a rhyming dictionary (source code and online example) that uses Moby project resources to find rhymes:
http://stevehanov.ca/blog/index.php?id=8


If you know of other projects or developments in this area, I'd love to hear about them ( http://www.distasis.com/connect.htm ).",public,0,,
16253,2017-03-28 07:59:00,2017-03-28 11:59:20,"My projects with rhyming and language translation and other dictionaries, word lists and thesauri","The last post mentioned other groups' projects with dictionaries and language resources. I thought I'd mention some of the projects I've been working on in this area.

I've been creating build scripts with the LM BLD project ( http://www.distasis.com/cpp/lmbld.htm ) so that I'll have automated, repeatable steps to build programs, libraries and other types of packages. Here are some of things I've been working on.

The Moby project is a very nice dictionary resource. Using their thesaurus, I was able to create a word list and a simple dictionary in stardict format. I use it with Open Source programs like scramble.

The Strongs concordance is in the public domain. I've created a translation dictionary in stardict format with it.

I happen to like the stardict dictionary format. There are several nice programs that can work with that format. I wanted something lightweight that would work well on older systems or let me create my own GUI interfaces. The closest thing I could find to what I wanted was sdcv. However, there were a few issues I had with it. The biggest is that it requires glib as a dependency and I didn't want to install GTK+ related dependencies on my systems. The second issue I ran into was that it couldn't handle some of the newer versions of the stardict format. Since the code is GNU GPL licensed, I started with it and made several modifications and customizations. The result is sdcv2 which can be linked to my own Unicode shared libraries in place of glib if desired and can work with dictionaries in more recent stardict formats. It may not make use of all the latest features in the newer formats, but it can at least access information from them.

I've seen other projects that use the sdcv library as a back-end and create their own GUI for a dictionary program. It makes sense if the program uses GTK+, but it seems awkward for Qt or other GUI programs to require GTK+ related dependencies. With sdcv2, there are no GTK+ related dependencies.

I would love to find a dictionary with a FLTK GUI, especially if it can handle stardict format. Since, I haven't been able to find one, I may try to write one at some point. I've also been thinking about creating a pdcurses front end. When I use sdcv (or sdcv2) from the command line, certain systems like Windows can't handle input or output of certain Unicode characters correctly. I've added support for SDL 2.x, SDL2_ttf and the ability to work with a range of Unicode characters within the UCS-2 character set to pdcurses. I think pdcurses would make an interesting front end for a program using the sdcv2 library. It would work on any system that supports SDL 1.x or 2.x, including more unusual operating systems like Syllable and Haiku. Would like to hear from others who may be interested in or are working on similar projects.

The dictzip program compresses dictionary files. It uses an extension to the gzip format with extra fields to include information about the compressed dictionary. Files compressed with this format often use the .dz extension. You can use dictzip with stardict files to save space. dictzip is primarily a POSIX compliant program, so it doesn't convert well to certain systems. I was able to find a Windows port that limited the program's functionality, but did enough to get the job of compression done. I've made some modifications to it and am using it as a cross-platform method of compressing stardict dictionary files.

Several utilities and conversion programs were created for stardict in the stardict-tools project. Similar to stardict and sdcv, glib is a dependency for stardict-tools. There are a few tools that use a GTK+ front-end as well. I personally only use the stardict-tools to convert tab delimited files and files in babylon format to stardict. So, I modified the command line tools that do those conversions to build without glib. I also created my own makefile just to build the tools I use.

I've searched and I've yet to find a rhyming dictionary in stardict format. So, I'm working on creating one. It's a slow process. I've taken a public domain rhyming dictionary as a starting point and I'm in the process of editing it and converting it to the format I need.

I've also been searching for an Open Source C/C++ grammer checker, but I've yet to find one that I like.

These are just some of the projects I'm working on. If you're interested in comparing notes on these topics or if you have recommendations of other dictionary and word related projects you like, feel free to contact me ( http://www.distasis.com/contact.htm ).
This is a list of mono-spaced fonts, Unicode fonts, font resources in general and useful FLOSS font sites.

Monospaced fonts are particularly useful for programming. They can help improve readability. Some programming editors (like PFE) only work with monospaced fonts. Other editors allow the use of more varied fonts. However, monospaced fonts are useful when working with PRE tags in HTML, ASCII art and in other areas.

Other useful fonts include Unicode fonts for internationalized projects and fonts for people with handicaps or disabilities such as fonts designed for people with dyslexia.

Using a Free, Libre or Open Source font or a public domain or Creative Commons licensed font makes it easy to reuse fonts in Open Source (or even commercial) projects.


Bitmap Fonts

woafon
Fonts for DOS and Windows Command Prompts
http://www.resoo.org/docs/dos/free_software/windos.htm


Free Monospaced Fonts (Bitmap)

Dina Programming Font
http://www.donationcoder.com/Software/Jibz/Dina

efont
Electronic Font Open Laboratory
http://openlab.ring.gr.jp/efont/index.html.en

Envy Code A
http://damieng.com/creative/typography/envy-code-a

Envy Code B
http://damieng.com/creative/typography/envy-code-b

Gohufont
http://font.gohu.org/

Liberation Fonts
https://fedorahosted.org/liberation-fonts/

MonteCarlo
http://www.bok.net/MonteCarlo/

Profont
MIT license
http://tobiasjung.name/profont/

Proggy Fonts
http://www.proggyfonts.net/

Raize font
http://www.raize.com/devtools/tools/rzfont.asp

Tamsyn
http://www.fial.com/~scott/tamsyn-font/

Terminus font
Includes a large size font especially useful for terminals in framebuffer mode on Linux systems.
http://terminus-font.sourceforge.net/

Triskweline
http://www.netalive.org/tinkering/triskweline/


Free Truetype Monospaced Fonts

Adobe Fonts
https://github.com/adobe-fonts

Andika
http://software.sil.org/andika/

Bitstream Vera
https://www.gnome.org/fonts/

Crystal
Available with Povray software.
https://github.com/POV-Ray/povray/tree/master/distribution/include
http://www.povray.org/povlegal.html

Cutive Mono
https://github.com/vernnobile/CutiveFont

DejaVu Fonts
http://dejavu-fonts.org/wiki/index.php?title=Main_Page

Droid Sans Mono
http://damieng.com/blog/2007/11/14/droid-sans-mono-great-coding-font

Envy Code R
http://damieng.com/blog/2008/05/26/envy-code-r-preview-7-coding-font-released

Font Squirrel Monospaced fonts
http://www.fontsquirrel.com/fonts/list/find_fonts?filter%5Bclassification%5D=monospaced&filter%5Bfamily_size%5D=1&filter%5Bdownload%5D=all&sort=a2z

GNU FreeFont
Freemono
https://www.gnu.org/software/freefont/

GNU Unifont
http://unifoundry.com/unifont.html

Inconsolata
http://www.levien.com/type/myfonts/inconsolata.html

Junicode
Unicode font for medievalists
http://junicode.sourceforge.net/

Lucida Sans Typewriter Regular - included with Oracle's Java SDK
http://docs.oracle.com/javase/7/docs/technotes/guides/intl/font.html

Matthew Welch's MIT licensed fonts
White Rabbit worked as a monospaced font
http://www.squaregear.net/fonts/

MonteCarlo
http://www.bok.net/MonteCarlo/

Oxygen Monospace
Oxygen Fonts
https://github.com/vernnobile/oxygenFont

ParaType
PT Mono
http://www.paratype.com/public/

Roboto Mono
Google's Roboto fonts
https://github.com/google/roboto/

SIL Encore Fonts
Several unicode fonts including some monospaced ones.
http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&id=SILEncoreFonts

Source Code Pro
Adobe Font
http://adobe-fonts.github.io/source-code-pro/
https://github.com/adobe-fonts/source-code-pro

String Literal
http://www.berfont.com/fonts/14121/string_literal___variable.html

Ubuntu Fonts
Ubuntu Mono
http://font.ubuntu.com/

Xerox Alto mono-spaced font
http://damieng.com/blog/2008/05/26/envy-code-r-preview-7-coding-font-released


Other Fonts

EB Garamond
http://www.georgduffner.at/ebgaramond/index.html

e-foundry
http://www.gust.org.pl/projects/e-foundry

Lato Fonts
http://www.latofonts.com/

Libertine Open Fonts Project
http://www.linuxlibertine.org/index.php?id=1&L=1

Proza-Libre
https://github.com/jasperdewaard/Proza-Libre

Simon Tatham's Fonts Page
http://www.chiark.greenend.org.uk/~sgtatham/fonts/

Sorts Mill fonts
https://bitbucket.org/sortsmill/sortsmill-fonts


Font Related Links

Cufon Type Converter
Convert TTF fonts to cufon font format for web sites. With a few text changes, fonts can be used with Raphael.js too.
http://cufon.shoqolate.com/generate/

FontForge
Create your own fonts.
http://fontforge.github.io/en-US/

Font Library
https://fontlibrary.org/en/terms
https://fontlibrary.org/en/guidebook/existing_libre_open_fonts

Font Squirrel
Handpicked Free Fonts
http://www.fontsquirrel.com/

Google Fonts
https://fonts.google.com/

Monospace/Fixed Width Programmer's Fonts
http://lowing.org/fonts/

OpenDyslexic
http://opendyslexic.org/

SIL Open Font License (OFL)
http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&id=OFL

Wikipedia Open Source Unicode Typefaces
https://en.wikipedia.org/wiki/Open-source_Unicode_typefaces

Unicode Font Guide For Free/Libre Open Source Operating Systems
http://www.unifont.org/fontguide/

July 2017

S M T W T F S
      1
234 5678
9101112131415
16171819202122
23242526272829
3031     

Syndicate

RSS Atom

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Sep. 20th, 2017 12:39 pm
Powered by Dreamwidth Studios