Skip to main content

On Problems of Automatic Legal Texts Processing and Information Acquiring from Normative Acts

  • Chapter
Advances in Business ICT

Abstract

In the paper, problems of legal information digitalization are investigated. Conditions for extraction information from legal texts (i.a. normative acts) related to the common ones processing (non-legal terms, in English) are outlined. Problems of dimensionality reduction and application of similarity measures are discussed. Sample results of similarity analysis is presented. Further research aimed at semantic analysis of legal texts are outlined.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Anderson, E., Bai, Z., Bischof, C., Blackford, S., Demmel, J., Dongarra, J., Croz, J., Du, G.A., Hammarling, S., McKenney, A., Sorensen, D.: LAPACK User’s Guide, 3rd edn. SIAM, Philadelphia (1999), http://www.netlib.org/lapack/lug/lapack_lug.html

    Book  MATH  Google Scholar 

  2. Broda, B., Piasecki, M.: SuperMatrix: a General Tool for Lexical Semantic Knowledge Acquisition. In: Speech and Language Technology, pp. 239–254. Polish Phonetics Assocation (2008)

    Google Scholar 

  3. Budanitsky, A., Hirst, G.: Semantic Distance in WordNet: An Experimental, Application-Oriented Evaluation of Five Measures. In: Proc. of the Workshop WordNet and Other Lexical Resources, Second Meeting of the North Am. Chapter of the Association for Computational Linguistics (2001)

    Google Scholar 

  4. Charikar, M.S.: Similarity Estimation Techniques from Rounding Algorithms. In: Reif, J. (ed.) STOC 2002 Proceedings of the Thirty-fourth Annual ACM Symposium on Theory of Computing. ACM (2002)

    Google Scholar 

  5. Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. Journal of the American Society of Information Science 41(6), 391–407 (1990)

    Article  Google Scholar 

  6. Deerwester, S., et al.: Improving Information Retriev-al with Latent Semantic Indexing. In: Proc. of the 51st Annual Meeting of the American Society for Information Science, vol. 25, pp. 36–40 (1988)

    Google Scholar 

  7. Hand, D., Mannila, H., Smyth, P.: Principles of Data Mining. MIT Press (2001)

    Google Scholar 

  8. Jolliffe, I.T.: Principal Component Analysis, 2nd edn. Springer, New York (2002)

    MATH  Google Scholar 

  9. Kamvar, S.D., Haveliwala, T.H., Manning, C.D., Golub, G.H.: Extrapolation methods for accelerating PageRank computations. In: Proc. of the WWW 2003 Proc. of the 12th International Conference on World Wide Web, pp. 261–270. ACM, New York (2003)

    Google Scholar 

  10. Karhunen, K.: Zur Spektraltheorie Stochastischer Prozesse. Annales Academiae Scientiarum Fennicae, Series A1, Mathematica-Physica 34, 1–7 (1946)

    Google Scholar 

  11. Kisz, A.: Model cybernetyczny powstawania i działania prawa, Wrocław (1970)

    Google Scholar 

  12. Landauer, T., Dumais, S.: A solution to Plato’s problem: The latent semantic analysis theory of acquisition. Psychological Review 104(2), 211–240 (1997)

    Article  Google Scholar 

  13. Li, Y.H., Bandar, Z., McLean, D.: An Approach for Measuring Semantic Similarity Using Multiple Information Sources. IEEE Trans. Knowledge and Data Eng. 15(4), 871–882 (2003)

    Article  Google Scholar 

  14. Li, Y., McLean, D., Bandar, Z.A., O’Shea, J.D., Crockett, K.: Sentence Similarity Based on Semantic Nets and Corpus Statistics. IEEE Transaction of Knowledge and Data Engineering 18(8), 1138–1150 (2006)

    Article  Google Scholar 

  15. Loève, M.M.: Probability Theory. VanNostrand, Princeton (1955)

    MATH  Google Scholar 

  16. Malinowski, A.: Wstęp do badań cybernetycznych w prawoznawstwie, Warszawa (1977)

    Google Scholar 

  17. Markines, B., Cattuto, C., Menczer, F., Benz, D., Hotho, A., Stumme, G.: Evaluating Similarity Measures for Emergent Semantics of Social Tagging. In: WWW 2009 Proceedings of the 18th International Conference on World Wide Web. ACM (2009)

    Google Scholar 

  18. Maziarz, M., Piasecki, M., Szpakowicz, S.: Approaching plWordNet 2.0. In: Proc. of the 6th Global Wordnet Conference, Matsue, Japan, January 9-13 (2012) (accepted for publishing)

    Google Scholar 

  19. Palmirani, M., Cervone, L., Vitali, F.: A Legal Document Ontology: The Missing Layer in Legal Document Modeling. In: Sartor, G. (ed.) Approaches to Legal Ontologies. Thwoeiwa, Domains, Methodologies. Springer, Dordrecht (2011)

    Google Scholar 

  20. Potiopa, P.: Methods and tools for the automatic processing of textual information and its use in the process of knowledge management. Automatyka 15(2), 409–419 (2011)

    Google Scholar 

  21. Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill (1983)

    Google Scholar 

  22. Sartor, G.: Introduction: ICT and Legislation in the Knowledge Society. In: Sartor, G., Palmirani, M., Francesconi, E., Biasiotti, M.A. (eds.) Legislative XML for Semantic Web, p. 23. Springer, Dordrecht (2012)

    Google Scholar 

  23. Sobczak, K.: Prawo a informatyka. Wydawnictwo Prawnicze, Warszawa (1978)

    Google Scholar 

  24. Stewart, G.W.: On the Early History of the Singular Value Decomposition. SIAM Review 35(4), 551–566 (1993)

    Article  MathSciNet  MATH  Google Scholar 

  25. Studnicki, F.: Cybernetyka a prawo, Warszawa (1969)

    Google Scholar 

  26. Turtle, H.: Text Retrieval in the Legal World. Artificial Intelligence and Law (3) (1995)

    Google Scholar 

  27. Vitali, F.: A Standard-Based Approach for the Man-agement of Legislative Documents. In: Legislative XML for the Semantic Web, Law, Governance and Technology Series, vol. 4, p. 44 (2011)

    Google Scholar 

  28. Wiewiórowski, W.R., Wierczyński, G.: Informatyka prawnicza. Technologia informacyjna dla prawników i administracji publicznej. Oficyna – Wolters Kluwer business, Warszawa (2008)

    Google Scholar 

  29. Yang, K., Shahabi, C.: A PCA-Based Similarity Measure for Multivariate Time Series. In: ACM International Workshop on Multimedia Databases (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tomasz Pełech-Pilichowski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Pełech-Pilichowski, T., Cyrul, W., Potiopa, P. (2014). On Problems of Automatic Legal Texts Processing and Information Acquiring from Normative Acts. In: Mach-Król, M., Pełech-Pilichowski, T. (eds) Advances in Business ICT. Advances in Intelligent Systems and Computing, vol 257. Springer, Cham. https://doi.org/10.1007/978-3-319-03677-9_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-03677-9_4

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-03676-2

  • Online ISBN: 978-3-319-03677-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics