Jan Daciuk
Publications
THESES
BOOKS AND BOOK CHAPTERS
- Jan Daciuk, Optimization of Automata, Gdańsk University of
Technology Publishing House, 2014.
- Jan Daciuk, Incremental Construction of Finite-State
Automata, in: Handbook
of Finite State Based Models and
Applications, Jiacun Wang (ed.), pp. 171-190, CRC Press, 2013.
- Jan Daciuk, Przetwarzanie języka naturalnego (Natural
language processing), in: Bodgan Wiszniewski (ed.), Inteligentne wydobywanie informacji z internetowych
serwisów społecznościowych (Intelligent information extraction
from internet services), Pomorskie Wydawnictwo Naukowo-Techniczne,
2011.
- Jan Daciuk, Mariusz
Szwoch, Automaty jako narzędzia w przetwarzaniu języka
(Automata as tools in natural language processing), in: Bodgan Wiszniewski (ed.), Inteligentne wydobywanie informacji z internetowych
serwisów społecznościowych (Intelligent information extraction
from internet services), Pomorskie Wydawnictwo Naukowo-Techniczne,
2011.
- Jan Daciuk, Jakub
Piskorski, Strahil
Ristov, Natural Language Dictionaries Implemented as Finite
Automata,
in: Scientific
Applications of Language Methods, Carlos Martín-Vide (ed.),
pp. 133-204, Imperial College Press, 2010.
PAPERS
- Jan Daciuk, Preserving Trees in Minimal
Automata, proceedings of FSMNLP 2015, Düsseldorf,
Germany, June 2015
- Jan Daciuk, Dawid
Weiss, Smaller representation of finite-state automata,
Theoretical Computer Science, vol. 450, pp. 10-21, 7 September 2012.
- Jan Daciuk, Dawid
Weiss, Smaller Representation of Finite State Automata,
in: Implementation
and Application of Automata. 16th International Conference, CIAA
2011, Blois, France, July 2011. Proceeedings, Beatrice
Bouchou-Markhoff, Pascal Caron, Jean-Marc Champarnaud, Denis Maurel
(eds.), LNCS 6807, Springer, 2011.
- Rafael C. Carrasco,
Jan Daciuk, Mikel
L. Forcada, Incremental
Construction of Minimal Tree
Automata, Algorithmica,
Volume
55, Number
1, pp. 95-110, September 2009.
- Rafael C. Carrasco,
Jan Daciuk, A
perfect hashing incremental scheme for unranked trees using
pseudo-minimal automata, RAIRO
- Theoretical Informatics and Applications, Volume 43,
Issue 04,
pp 779-790, DOI: http://dx.doi.org/10.1051/ita/2009018, October
2009.
- Jan Daciuk, Rafael
C. Carrasco, Perfect Hashing with Pseudo-minimal Bottom-up
Deterministic Tree Automata, Intelligent Information Systems XVI,
Proceedings of the International IIS'08 Conference held in
Zakopane, Poland, June 16-18, 2008, Mieczysław A. Kłopotek, Adam
Przepiórkowski, Sławomir T. Wierzchoń, Krzysztof Trojanowski
(eds.), Academic Publishing House Exit, Warszawa 2008.
- Jan Daciuk, Perfect Hashing Tree Automata, FSMNLP 2007,
Potsdam, Germany, 2007.
- Rafael C. Carrasco,
Jan Daciuk, Mikel
L. Forcada, An
Implementation of Deterministic Tree Automata Minimization,
Implementation and Application of Automata. 12th International
Conference, CIAA 2007, Prague, Czech Republic, July 2007, Revised
Selected Papers, Jan Holub, Jan Zdarek (eds.), LNCS 4783, Springer
2007.
- Jan Daciuk, Jakub Piskorski, Gazetteer compression technique based
on substructure recognition, In: Intelligent information
processing and web mining, Proceedings of the International
IIS:IIPWM'06 Conference held in Ustroń, Poland, June
19-22, 2006, pp. 87-95, Springer Verlag, Advances in Soft Computing,
2006.
- Denis
Maurel, Jan Daciuk, Les transducteurs a sorties variables,
Verbum ex machina, Actes
de la 13eme conference annuelle
sur le traitement des langues naturelles, Piet Mertens, Cedrick
Fairon, Anne Dister, Patrick Watrin (eds.), pp. 237-245, volume 1,
UCL Presses Universitaires de Louvain, Louvain, Belgium, 10-13 April
2006.
- Jan Daciuk,
Denis
Maurel, Agata
Savary, Incremental and Semi-incremental Construction of
Pseudo-Minimal Automata, Implementation and Application of
Automata: 10th
International Conference, CIAA 2005, Sophia Antipolis, France, June
27-29, 2005, Revised Selected Papers, pp. 341-342, Jacques Farre,
Igor Litovsky, Sylvain Schmitz, LNCS Vol. 3845/2006, Springer 2006.
- Jan Daciuk,
Denis
Maurel, Agata
Savary, Dynamic Perfect
Hashing with Finite-State
Automata, in Intelligent Information Processing and Web Mining,
Proceedings of the International
IIS: IIPWM'05 Conference held in Gdańsk, Poland, June 13-16, 2005,
Mieczysław A. Kłopotek, Sławomir Wierzchoń, Krzysztof Trojanowski
(eds.), Advances in Soft Computing, Springer, 2005.
- Jan Daciuk, Extension of Selected ADFA Construction Algorithms to
the Case of Cyclic Automata, Eindhoven FASTAR Days 2004, Eindhoven, the Netherlands, September 2004.
- Jan Daciuk, Comments
on Incremental Construction and Maintenance of Minimal
Finite-State Automata, by Rafael C. Carrasco and Mikel
L. Forcada, Computational
Linguistics, 30(2), pp. 227-235, June 2004.
- Jan Daciuk, Semi-incremental Addition of Strings to a Cyclic
Finite Automaton, Intelligent Information Processing and Web
Mining, Proceedings of the International IIS: IIP WM'04
Conference held in Zakopane, Poland, May 17-20, 2004. Advances in Soft
Computing, pp. 201-207, Mieczysław A. Kłopotek, Sławomir
T. Wierzchoń, Krzysztof Trojanowski (eds.), Springer-Verlag, 2004.
- Jan Daciuk, Finite-State Lexical Tools, BIS 2004, 7th International
Conference on Business Information Systems, pp. 373-380,
Witold Abramowicz
(ed.), Wydawnictwo Akademii Ekonomicznej w Poznaniu, Poznań, Poland,
21-23 April, 2004.
- Jan Daciuk, Gertjan
van Noord, Finite
automata for compact representation of tuple dictionaries, Theoretical
Computer Science, 313(1), 16 Feb., 2004
- Bruce W. Watson, Jan Daciuk, An
efficient incremental DFA minimization algorithm, Natural
Language Engineering, 9(1), pp. 49-64, Cambridge University
Press, March 2003.
- Jan Daciuk, Semi-Incremental Construction of Minimal Cyclic
Finite-State Automata Using Continuation Classes, Proceeding of
the Workshop on
Finite-State Methods in Natural Language Processing at the 10th Conference of the
European Chapter of the Association for Computational
Linguistics, Budapest, Hungary, April 13th-14th, 2003.
- Jan Daciuk, Comparison of Construction Algorithms for Minimal,
Acyclic, Deterministic, Finite-State Automata from Sets of
Strings, Seventh
International Conference on Implementation and Application of
Automata CIAA '2002, Tours, France, 2002. (C)
Springer Verlag LNCS Series
2608, pp. 255-261, January 2003.
- Jan Daciuk, Computer-Assisted
Enlargement of Morphological Dictionaries,
Finite
State Methods in Natural Language Processing, Workshop at 13th European Summer School in
Logic, Language and Information, Helsinki, Finland, 13-24 August, 2001.
- Jan Daciuk, Gertjan
van Noord, Finite Automata for Compact Representation of
Language Models in NLP, Sixth International Conference on
Implementation and Application of Automata, CIAA'2001, Pretoria,
South Africa, July 23-25, 2001. (C) Springer Verlag LNCS Series 2494,
pp. 65-73,
2003.
- Jan Daciuk, Finite State Tools for
Natural Language Processing,
proceedings of the
COLING 2000 workshop Using
Toolsets and Architectures to Build NLP Systems, pp. 34-37, Centre
Universitaire, Luxembourg, August 5th, 2000.
- Jan Daciuk, Experiments with Automata
Compression, proceedings of Conference on Implementation
and Application of Automata CIAA'2000, London, Ontario, Canada,
July 24-26, 2000, (C) Springer Verlag
LNCS Series 2088,
pp. 105-112, 2001.
- Jan Daciuk, Stoyan Mihov, Bruce
Watson, and Richard Watson,
Incremental Construction of Minimal
Acyclic Finite State Automata, Computational
Linguistics, 26(1), pp. 3-16,
March 2000.
- Jan Daciuk, Treatment
of Unknown Words, proceedings of Workshop on Implementing
Automata WIA'99, Potsdam, Germany, 1999, (C) Springer Verlag LNCS Series Volume
2214, pp. 71-80, 2001.
- Jan Daciuk, A Module for Treatment of Unknown Words,
Speech and Language Technology, Wiktor Jassem, Czesław Basztura,
Grażyna Demenko and Krzysztof Jassem (eds.), Volume 3, pp. 165-169,
Poznań, Poland, 1999.
- Jan Daciuk, Bruce Watson, and
Richard Watson, Incremental Construction of
Minimal Acyclic Finite State Automata and Transducers, proceedings of
Finite State
Methods in Natural Language Processing, pp. 48-56, Bilkent University,
Ankara, Turkey, June 29 - July 1, 1998.
- Jan Daciuk, Finite-State Automata in Document Retrieval,
Intelligent
Information Systems VII, Proceedings of the Workshop held in
Malbork, Poland, June 15-19, 1998.
- Jan Daciuk, Dopisywanie polskich znaków diakrytycznych za pomocą
automatów (Restoring Polish diacritics using automata), Krajowe Sympozjum
Telekomunikacji (State Symposium of Telecommunication), Bydgoszcz,
1997.
- Jan Daciuk, Automaty jedno- i dwupoziomowe w przetwarzaniu języka
naturalnego (Automata and Transducers in Natural Language Processing),
proceedings of Informatyka na Wyższych
Uczelniach dla Gospodarki Narodowej (Computer Science at Universities
for National Economy), Gdańsk, 1996.
- Jan Daciuk, Wspomagana komputerowo korekta tekstów
pisanych w języku polskim (Computer Aided Correction of Texts Written
in Polish), Seminarium Język i Technologia 1995 (Language and
Technology Seminar 1995), Poznań, 1995.
- Jan Daciuk, Zastosowania przetwarzania języka naturalnego
(Natural Language Processing Applications), proceedings of
Informatyka
na Wyższych Uczelniach dla Gospodarki Narodowej
(Computer Science at Universities for National Economy),
Gdańsk, 1994.
- Jan Daciuk, Andrzej Jędruch, Statystyczna metoda polonizacji tekstów
(Statistical Method of Restoring Diacritics in Texts in Polish),
Krajowe Sympozjum Telekomunikacji (State Symposium of Telecommunication),
Bydgoszcz, 1994.
- Andrzej Guziński,
Andrzej Borzyszkowski,
Jan Daciuk,
and Maciej Guziński,
Application of the CITIUS System to Teaching Electronic Circuits,
"III Seminarium Zastosowanie komputerów w dydaktyce '93"
(3rd Seminar "Application of Computers in Didactics '93",
Zeszyty Naukowe Wydziału Elektrycznego Politechniki Gdańskiej
(Scientific Papers of the Electric Faculty of the Technical University
of Gdańsk) No 3, Gdańsk, 1993.
- Andrzej Guziński
, Andrzej Borzyszkowski
, Jan Daciuk, and Maciej Guziński,
CITIUS CAT System. Part II: Lecture Presentation Program,
Fourth CITIUS Workshop, Gdańsk, 1993.
- Andrzej Guziński,
Andrzej Borzyszkowski
, Jan Daciuk, and Maciej Guziński,
CITIUS CAT System. Part I; Lecture Preparation System, Third
Citius Workshop, Mulhouse, 1992.
- Jan Daciuk, Andrzej Borzyszkowski
, and Andrzej Guziński,
and Maciej Guziński
, CITIUS Computer Aided Teaching System, Second CITIUS
Workshop, Gdańsk, 1992.
- Andrzej Guziński
, Jan Daciuk, and
Maciej Guziński, CAT of Electronic Circuits,
First CITIUS Workshop, Strasbourg - Karlsruhe, 1991.
UNPUBLISHED PAPERS
- Jan Daciuk, Gertjan
van Noord, A Finite-State Library for
NLP, presented at CLIN 2001 (Twelfth Meeting of Computational Linguistics in the
Netherlands), University of Twente, Enschede, 30 November 2001.
- Jan Daciuk, Gramatyki połączeniowe w korekcie polskich tekstów
(Link Grammars in Correction of Texts in Polish), technical report,
Technical University of Gdańsk, 1993.
- Jan Daciuk, Wymagania dla słownika języka polskiego do sprawdzania
poprawności tekstów w języku polskim
(Requirements for a Polish Language Lexicon for Correction of Texts in
Polish),
technical report, Technical University of Gdańsk, 1992.
INVITED TALKS
- "Incremental Construction of Minimal
Deterministic Tree Automata". Presented at the Seminar für
Sprachwissenschaft, Tübingen University, Germany, May 19th,
2006.
- "Extension of Selected ADFA Construction
Algorithms to the Case of Cyclic Automata". Presented at a
Seminar in IUT Blois, Universite de Tours, Blois, France, November 25th,
2004.
- "Extension of Selected ADFA Construction
Algorithms to the Case of Cyclic Automata". Presented at a
Seminar in Departement d'Informatique, Ecole Polytechnique de l'Université
de Tours, Tours, France, November 17st, 2004.
- "Guessing Automata". Presented at a seminar of a research group Langues et
Representations, Université de Tours, Tours, France, November 2nd,
2004.
- "Incremental Construction of Minimal
Finite State Automata". Presented at Université de Tours,
Tours, France, October 21st, 2004.
- "Extension of Selected ADFA Construction
Algorithms to the Case of Cyclic Automata". Presented at a
Seminar in Departament de Llenguathes i Sistems Informatics,
Universitat d'Alacant, Alacant/Alicante, Spain, September 20th, 2004.
- "Computer-Aided Enlargment of Morphological
Dictionaries". Presented at the Natural Language Processing
Seminar, The Linguistic Engineering / Formal Linguistics Group,
Linguistic Engineering Group at the Department of
Artificial Intelligence, Institute of Computer Science, Polish Academy
of Sciences, Warsaw, Poland, June 25th, 2001.
- "Incremental Construction of Minimal, Deterministic, Acyclic,
Finite-State Automata". Presented at the Seminar für
Sprachwissenschaft, Tübingen University, Germany, May 24th,
2000.
- "Construction of Guessing Automata for Morphological Analysis and
Morphological Descriptions". Presented at the Seminar für
Sprachwissenschaft, Tübingen University, Germany, May 22nd,
2000.
- "Incremental Construction of Finite-State Automata". Presented at
the Mathematical Faculty of the University of Gdańsk, Gdańsk, Poland,
1999.
- "Link Grammars". Presented at the Mathematical Faculty of the
University of Gdańsk, Gdańsk, Poland, 1993.
REPORTS AND PROPOSALS
SOFTWARE AND DOCUMENTATION
- adfa: a program for testing various
acyclic automata construction methods, Rijksuniversiteit
Groningen, the Netherlands, 2002.
- A set of programs for testing automata
minimization algorithms, and in particular the new (my)
version of the incremental algorithm by Bruce Watson,
Rijksuniversiteit Groningen, the Netherlands, 2002.
- Fadd: a library for accessing dictionaries in form of
finite-state automata, finite-state perfect hashing functions, and
compressed finite-state language models, Rijksuniversiteit
Groningen, the Netherlands, 2001. Based on the fsa package.
- Fsa: A package of programs for construction and use of
finite-state automata for morphological analysis, spelling correction,
restoration of diacritics, and perfect hashing. Written at ISSCO,
University of Geneva, Switzerland, 1995-96, maintained at the Technical
University of Gdańsk, Poland.
- Utr: A package for construction and use of finite-state
transducers for morphological analysis and synthesis, spelling
correction, and restoration of diacritics. Written at ISSCO,
University of Geneva, Switzerland, 1995-96, maintained at the Technical
University of Gdańsk, Poland.
- Opis procedur dostępu do ekranu komputera Jantar
(Description of Procedures that Access the Screen of Jantar
Computer), internal publication of the Technical University of
Gdańsk, 1991.
Jan Daciuk
email: jjaannddaacc@eti.pg.gda.pl (remove the stuttering!)