-
Notifications
You must be signed in to change notification settings - Fork 51
To Do list
Sebastian Humenda edited this page Aug 30, 2016
·
6 revisions
An informal and messy list of things to do. May be used for quick notes before they are ripe enough to be entered into the project issue page.
- MFAQ: add the non-server method of access (GoldenDict)
- check whether
[[_TOC_]]
is supported in GitHub wikis and use them for the HOWTO chapters - update Chapter 4
- how about upgrading Michael's freedict-editor to a simple form of TEI-P5?
- Beata and Piotr use FLEx for the non-FreeDict Swahili-Polish dictionary; Piotr plans to create a LIFT-TEI roundtripping system that could be used for FreeDict as well
- introduce changes for OLAC
- restrict the ODD further (hmm, create a higher level ODD, that is... enumerate @type for cit ("dicteg", "trans"?), types of notes(?), etc.)
- think of setting up an ontology for grammatical stuff, either GOLD or (perhaps easier?) ISOCat (see a proposal)
- look at other dictionary formats
- StarDict people seemed not interested in cooperation
- SimiDic: hard to get contact to developers
- consider the switch to XSLT 2.0 for the style sheets
- 2016: no full and free open-source implementation available
- implement symlinks to schemas and such at last (?) -- but some software (oXygen) doesn't understand symlinks (post in oXygen forums); this is important, because otherwise we'll have to update stuff by hand from shared/
- post at least some of this list to the issue tracker...
- don't forget about the idea of getting rid of the "Maintainer" field from dictionary headers
- modify gender symbols to be more readable than e.g.
<n m>
(<n masc>
seems better), e.g. in Khasi-English
notes written during the switch to TEI-P5
- introduce the header plug-in to the ontology interface
- there are still brackets used in some of them (kha-deu, but also others)
- Ergane dictionaries:
- some multi-word entry words are trimmed -- make sure to fix that, too
- some equivalents are repeated
- some English pronunciation info is suboptimal or just wrong; stress marks are missing, too; possibly, use the English-Polish dictionary as the template, or go for something else that is standard.
- the distribution of equivalents across senses or just cits within a single sense may sometimes be odd (this is a general lexicographic issue)
- possible task: sensible reversal of an existing monodirectional dictionary
- possible task: concatenation of existing dictionaries; it would be good to have a system that may allow for the product of such concatenation to be updated
- necessary task: go over the FreeDict build system, making it absolutely transparent and as modular as possible
- sensible task: use a good dictionary for the basis of pronunciation information (e.g. the English-Polish dictionary is a good beginning for English pronunciation) and project that to all other dictionaries with the same source language
- another sensible task: use the info on equivalent grammatical properties (e.g. gender in *-deu or *-fra dictionaries) and project that to all the others
- style sheets do not split multiple orth's into multiple dict entries (and hence users can't search for them)
These came up in the process of introducing references to externally defined data categories.
- redundancy between
<pos>n</pos>
and@ana="\#n"
-- should we address it somehow?- it doesn't seem like it can be addressed at tagUsage level
- how to ensure equivalence between
<pos>vt</pos>
and<pos>v</pos><subc>trans</subc>
(neither repository has "transitivity" as a general category (GOLD's transitivity is different)) - which of the three "masculine"s (etc.) to choose from ISOCat? (weird)
- ensure that the masc gender in entry/gramGrp (and rel/gramGrp) need not be the same masc as that used in cit/gramGrp -- because they may very well realise different networks of oppositions.