Reinard van Dalen

Reinard van Dalen

Groningen, Groningen, Netherlands
454 followers 447 connections

About

Marketing and communication - Education - IT - Business - Entrepreneur - Governance - Politics - Skiing

Activity

Join now to see all activity

Experience

Education

  •  Graphic

    -

    -

    Activities and Societies: Letteren Vooruit

  • -

    -

    Activities and Societies: Academisch Studenten Comité Informatiekunde (ASCI), Ganymedes, Letteren Vooruit, Studentambassadeur en Lijst STERK.

  • -

    -

  • -

    -

    Activities and Societies: Leerlingenraad, schoolkrant

  • -

    -

Licenses & Certifications

Publications

  • Hatching Chick at SemEval-2018 Task 2: Multilingual Emoji Prediction

    The International Workshop on Semantic Evaluation: Proceedings of the Twelfth Workshop

    As part of a SemEval 2018 shared task an attempt was made to build a system capable of predicting the occurence of a language’s most frequently used emoji in Tweets. Specifically, models for English and Spanish data were created and trained on 500.000 and 100.000 tweets respectively. In order to create these models, first a logistic regressor, a sequential LSTM, a random forest regressor and a SVM were tested. The latter was found to perform best and therefore optimized individually for both…

    As part of a SemEval 2018 shared task an attempt was made to build a system capable of predicting the occurence of a language’s most frequently used emoji in Tweets. Specifically, models for English and Spanish data were created and trained on 500.000 and 100.000 tweets respectively. In order to create these models, first a logistic regressor, a sequential LSTM, a random forest regressor and a SVM were tested. The latter was found to perform best and therefore optimized individually for both languages. During developmet f1-scores of 61 and 82 were obtained for English and Spanish data respectively, in comparison, f1-scores on the official evaluation data were 21 and 18. The significant decrease in performance during evaluation might be explained by overfitting during development and might therefore have partially be prevented by using cross-validation. Over all, emoji which occur in a very specific context such as a Christmas tree were found to be most predictable.

    See publication
  • Profiling Dutch Authors on Twitter: Discovering Political Preference and Income Level

    Computational Linguistics in the Netherlands Journal

    Research in author profiling has primarily focused on English-speaking users and attributes like age, gender and occupation. We present first experiments on automatic profiling Dutch Twitter users for two less-studied attributes, namely their political preference and income level (low vs high). We create two novel corpora using distant supervision, evaluate the corpus creation approach, and train predictive models for each attribute. Our empirical evaluation shows that distant supervision is…

    Research in author profiling has primarily focused on English-speaking users and attributes like age, gender and occupation. We present first experiments on automatic profiling Dutch Twitter users for two less-studied attributes, namely their political preference and income level (low vs high). We create two novel corpora using distant supervision, evaluate the corpus creation approach, and train predictive models for each attribute. Our empirical evaluation shows that distant supervision is surprisingly reliable and political preference and income level of Dutch users can be predicted relatively accurately from the linguistic input. We also discuss which features are predictive for income and political preference, respectively.

    See publication

Languages

  • Nederlands

    Native or bilingual proficiency

  • Engels

    -

  • Duits

    -

View Reinard’s full profile

  • See who you know in common
  • Get introduced
  • Contact Reinard directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More