-
Notifications
You must be signed in to change notification settings - Fork 6
Expand file tree
/
Copy pathlectures.html
More file actions
1187 lines (1038 loc) · 92.5 KB
/
lectures.html
File metadata and controls
1187 lines (1038 loc) · 92.5 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<!DOCTYPE HTML>
<!--
Imagination by TEMPLATED
templated.co @templatedco
Released for free under the Creative Commons Attribution 3.0 license (templated.co/license)
-->
<html>
<head>
<title>SIGTYP -- Lecture Series</title>
<link rel="shortcut icon" type="image/x-icon" href="images/favicon.ico" />
<meta http-equiv="content-type" content="text/html; charset=utf-8" />
<meta name="description" content="" />
<meta name="keywords" content="sigtyp,sigtyp lectures,naacl,workshop,typology,linguistics,multilinguality" />
<link href='http://fonts.googleapis.com/css?family=Raleway:400,100,200,300,500,600,700,800,900' rel='stylesheet' type='text/css'>
<!--[if lte IE 8]><script src="js/html5shiv.js"></script><![endif]-->
<script src="https://ajax.googleapis.com/ajax/libs/jquery/1.11.0/jquery.min.js"></script>
<link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.4.1/css/bootstrap.min.css">
<!--script src="https://ajax.googleapis.com/ajax/libs/jquery/3.4.1/jquery.min.js"></script-->
<script src="https://maxcdn.bootstrapcdn.com/bootstrap/3.4.1/js/bootstrap.min.js"></script>
<script src="js/skel.min.js"></script>
<script src="js/skel-panels.min.js"></script>
<script src="js/init.js"></script>
<noscript>
<link rel="stylesheet" href="css/skel-noscript.css" />
<link rel="stylesheet" href="css/style.css" />
<link rel="stylesheet" href="css/style-desktop.css" />
</noscript>
<!--Q/A display-->
<script>
function resizeIframe(obj) {
obj.style.height = obj.contentWindow.document.documentElement.scrollHeight + 'px';
obj.style.height = '20px';
}
</script>
<style>
* {box-sizing: border-box}
.mySlides {display: none}
img {vertical-align: middle;}
/* Slideshow container */
.slideshow-container {
max-width: 1000px;
position: relative;
margin: auto;
}
/* Next & previous buttons */
.prev1, .next1 {
cursor: pointer;
position: absolute;
top: 50%;
width: 5%;
padding: 16px;
margin-top: -22px;
color: white;
font-weight: bold;
font-size: 18px;
transition: 0.6s ease;
border-radius: 0 3px 3px 0;
user-select: none;
}
/* Position the "next button" to the right */
.next1 {
right: 0;
border-radius: 3px 0 0 3px;
}
/* Position the "prev button" to the left */
.prev1 {
left: 0;
border-radius: 3px 0 0 3px;
}
/* On hover, add a black background color with a little bit see-through */
.prev1:hover, .next1:hover {
background-color: rgba(0,0,0,0.8);
}
/* Caption text */
.text1 {
color: inherit;
font-size: inherit;
padding: -30px 10px;
position: absolute;
bottom: -95px;
width: 100%;
text-align: center;
}
/* Number text (1/3 etc) */
.numbertext {
color: #f2f2f2;
font-size: 12px;
padding: 8px 12px;
position: absolute;
top: 0;
}
/* The dots/bullets/indicators */
.dot {
cursor: pointer;
height: 15px;
width: 15px;
margin: 0 2px;
background-color: #bbb;
border-radius: 50%;
display: inline-block;
transition: background-color 0.6s ease;
}
.active1, .dot:hover {
background-color: #717171;
}
/* Fading animation */
.fade {
-webkit-animation-name: fade;
-webkit-animation-duration: 1.5s;
animation-name: fade;
animation-duration: 1.5s;
}
@-webkit-keyframes fade {
from {opacity: .4}
to {opacity: 1}
}
@keyframes fade {
from {opacity: .4}
to {opacity: 1}
}
/* On smaller screens, decrease text size */
@media only screen and (max-width: 300px) {
.prev1, .next1,.text1 {font-size: 11px}
}
</style>
<style>
* {box-sizing: border-box}
.mySlides {display: none}
img {vertical-align: middle;}
/* Slideshow container */
.slideshow-container {
max-width: 1000px;
position: relative;
margin: auto;
}
/* Next & previous buttons */
.prev1, .next1 {
cursor: pointer;
position: absolute;
top: 50%;
width: 5%;
padding: 16px;
margin-top: -22px;
color: white;
font-weight: bold;
font-size: 18px;
transition: 0.6s ease;
border-radius: 0 3px 3px 0;
user-select: none;
}
/* Position the "next button" to the right */
.next1 {
right: 0;
border-radius: 3px 0 0 3px;
}
/* Position the "prev button" to the left */
.prev1 {
left: 0;
border-radius: 3px 0 0 3px;
}
/* On hover, add a black background color with a little bit see-through */
.prev1:hover, .next1:hover {
background-color: rgba(0,0,0,0.8);
}
/* Caption text */
.text1 {
color: inherit;
font-size: inherit;
padding: -30px 10px;
position: absolute;
bottom: -95px;
width: 100%;
text-align: center;
}
/* Number text (1/3 etc) */
.numbertext {
color: #f2f2f2;
font-size: 12px;
padding: 8px 12px;
position: absolute;
top: 0;
}
/* The dots/bullets/indicators */
.dot {
cursor: pointer;
height: 15px;
width: 15px;
margin: 0 2px;
background-color: #bbb;
border-radius: 50%;
display: inline-block;
transition: background-color 0.6s ease;
}
.active1, .dot:hover {
background-color: #717171;
}
/* Fading animation */
.fade {
-webkit-animation-name: fade;
-webkit-animation-duration: 1.5s;
animation-name: fade;
animation-duration: 1.5s;
}
@-webkit-keyframes fade {
from {opacity: .4}
to {opacity: 1}
}
@keyframes fade {
from {opacity: .4}
to {opacity: 1}
}
/* On smaller screens, decrease text size */
@media only screen and (max-width: 300px) {
.prev1, .next1,.text1 {font-size: 11px}
}
</style>
</style>
</head>
<body>
<div id="header-wrapper">
<!-- Header -->
<div id="header">
<div class="container">
<!-- Logo -->
<div id="logo">
<a href="index.html"><img src="images/sigtyp1.jpg" style="width:70px" alt="SIGTYP"></a>
<!--h1><a href="#"><font color=red>S</font>IG<font color=red>T</font>YP</a></h1-->
</div>
<!-- Nav -->
<nav id="nav">
<ul>
<li><a href="index.html">Homepage</a></li>
<li><a href="constitution.html">Constitution</a></li>
<li><a href="members.html">Members</a></li>
<li><a href="workshop.html">Workshop</a></li>
<li class="active"><a href="lectures.html">Lectures</a></li>
<li><a href="blog.html">Blog</a></li>
</ul>
</nav>
</div>
</div>
<!-- Header -->
<!-- Banner -->
<div id="banner">
<div class="container">
<section>
<!--span class="fa fa-cubes"></span-->
<header>
<h2 > </h2>
<h2 > </h2>
<h2 > </h2>
<span class="byline"> </span>
</header>
<!--a href="#" class="button medium">Fusce ultrices fringilla</a-->
</section>
</div>
</div>
<!-- /Banner -->
</div>
<!-- Main -->
<div id="main">
<div class="container">
<section>
<header>
<h2> <font color="red">S</font><font color="yellow">I</font><font color="green">G</font><font color="brown">T</font><font color="blue">Y</font><font color="purple">P</font> LECTURE SERIES</h2> </header>
<hr style="border-top: dotted 2px;"/>
<header>
<h2 style="font-size:28px">Next Event</h2>
</header>
<br/>
<div class="slideshow-container">
<!-- Full-width images with number and caption text -->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-jon-rawski-tickets-155525680497" class="image full"><img src="lectures/speakers/jon-rawski.png" style="width:100%"></a>
<div class="text1"><a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-jon-rawski-tickets-155525680497">Typology Emerges from Computability</a></div>
</div>
<div class="mySlides">
<div class="numbertext"></div>
<a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-tiago-pimentel-tickets-158698843513" class="image full"><img src="lectures/speakers/tiago-pimentel.jpeg" style="width:100%"></a>
<div class="text1"><a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-tiago-pimentel-tickets-158698843513">An Informative Exploration of the Lexicon</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-maria-ryskina-tickets-158694697111" class="image full"><img src="lectures/speakers/maria-ryskina.png" style="width:100%"></a>
<div class="text1"><a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-maria-ryskina-tickets-158694697111">Informal Romanization Across Languages and Scripts</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-shruti-rijhwani-tickets-159198706617" class="image full"><img src="lectures/speakers/shruti-rijhwani.png" style="width:100%"></a>
<div class="text1"><a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-shruti-rijhwani-tickets-159198706617">Cross-Lingual Entity Linking for Low-Resource Languages</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-david-inman-tickets-156546130691" class="image full"><img src="lectures/speakers/david-inman.png" style="width:100%"></a>
<div class="text1"><a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-david-inman-tickets-156546130691">Conceptual Interdependence in Language Description, Typology, and NLP: Examples from Nuuchahnulth</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-tuhin-chakrabarty-tickets-161252072289" class="image full"><img src="lectures/speakers/tuhin_chakrabarty.png" style="width:100%"></a>
<div class="text1"><a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-tuhin-chakrabarty-tickets-161252072289">Neuro-Symbolic methods for creative text generation</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-sabrina-j-mielke-tickets-156592866479" class="image full"><img src="lectures/speakers/sabrina-mielke.png" style="width:100%"></a>
<div class="text1"><a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-sabrina-j-mielke-tickets-156592866479">Fair Comparisons for Generative Language Models -- with a bit of Information Theory</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-richard-futrell-tickets-161249853653" class="image full"><img src="lectures/speakers/richard_futrell.png" style="width:100%"></a>
<div class="text1"<a href="https://www.eventbrite.co.uk/e/sigtyp-lecture-series-richard-futrell-tickets-161249853653">Investigating Information-Theoretic Influences on the Order of Elements in Natural Language</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://eleanorchodroff.com/" class="image full"><img src="lectures/speakers/eleanor-chodroff.png" style="width:100%"></a>
<div class="text1"><a href="https://eleanorchodroff.com/">Structure in Cross-linguistic Phonetic Realization</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://scholar.google.co.il/citations?user=Aaj_RBEAAAAJ&hl=en"" class="image full"><img src="lectures/speakers/amit_moryossef.png" style="width:100%"></a>
<div class="text1"><a href="https://scholar.google.co.il/citations?user=Aaj_RBEAAAAJ&hl=en"">Including Signed Languages in Natural Language Processing</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://d-ataman.github.io/" class="image full"><img src="lectures/speakers/duygu_ataman.png" style="width:100%"></a>
<div class="text1"><a href="https://d-ataman.github.io/">Machine Translation of Morphologically-Rich Languages: a Survey and Open Challenges</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="http://kat.academy" class="image full"><img src="lectures/speakers/ekaterina_vylomova.png" style="width:100%"></a>
<div class="text1"><a href="http://kat.academy">UniMorph and Morphological Inflection Task: Past, Present, and Future</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/adina_williams.png" style="width:100%"></a>
<div class="text1"><a href="https://TBA">TBA</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/kyle_gorman.png" style="width:100%"></a>
<div class="text1"><a href="http://wellformedness.com/">On "Massively Multilingual" Natural Language Processing</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/kyle_mahowald.png" style="width:100%"></a>
<div class="text1"><a href="https://mahowak.github.io/">“Deep” Subjecthood: Classifying Grammatical Subjects and Objects across Languages</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/kayo_yin.png" style="width:100%"></a>
<div class="text1"><a href="https://kayoyin.github.io"> Understanding, Improving and Evaluating Context Usage in Context-aware Machine Translation</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/tanja_samardzic.png" style="width:100%"></a>
<div class="text1"><a href="https://www.spur.uzh.ch/en/departments/personen/tanjasamardzic.html">Language Sampling</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/stymne-karamolegkou.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-stymne-karamolegkou">Transfer Language Choice for Cross-Lingual Dependency Parsing</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/mathias-muller.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-mueller">Exploring a Sampling-based Alternative to Beam Search</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/antonis-anastasopoulos.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-anastasopoulos">Extracting Linguistic Information from Text</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/shauli-ravfogel.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-ravfogel">Linear Information Removal Methods</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/claire-bowern.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-bowern">Linguistics and Voynichese</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/sabine_weber.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-weber">Using the Distributional Inclusion Hypothesis for Unsupervised Entailment Detection</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://TBA" class="image full"><img src="lectures/speakers/huygaa_batsuren.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-batsuren">Understanding and Exploiting Language Diversity: Building Linguistic Resources from Cognate to Morphology</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://aniellodesanto.github.io/" class="image full"><img src="lectures/speakers/aniello_de_santo.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-de-santo">Mathematical Linguistics and Typological Complexity</a></div>
</div-->
<!--div class="mySlides">
<div class="numbertext"></div>
<a href="https://github.com/AleksandrsBerdicevskis" class="image full"><img src="lectures/speakers/aleksandrs_berdicevskis.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-berdicevskis"> "Typology, will you marry sociolinguistics?" asks NLP</a></div>
</div-->
<!--<div class="mySlides">
<div class="numbertext"></div>
<a href="hhttps://www.lsv.uni-saarland.de/people/badr-abdullah/" class="image full"><img src="lectures/speakers/badr_abdullah.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-abdullah"> Capturing Cross-linguistic Similarity with Speech Representation Learning</a></div>
</div>-->
<div class="mySlides">
<div class="numbertext"></div>
<a href="https://esalesky.github.io/" class="image full"><img src="lectures/speakers/elizabeth_salesky.png" style="width:100%"></a>
<div class="text1"><a href="https://sigtyp.inf.ethz.ch/channel/lecture-salesky"> Looking beyond Unicode for Open-Vocabulary Text Representations</a></div>
</div>
<!--div class="mySlides">
<div class="numbertext"> <strong><font color="red" size="12px"> PAST </font> </strong> </div>
<a href="#lecture-zamaraeva" class="image full"><img src="lectures/speakers/olga-zamaraeva.png" style="width:100%"></a>
<div class="text1"><a href="#lecture-zamaraeva">Typologically-driven Modeling of wh-Questions in a Grammar Engineering Framework</a></div>
</div-->
<!-- Next and previous buttons -->
<a class="prev1" onclick="plusSlides(-1)">❮</a>
<a class="next1" onclick="plusSlides(1)">❯</a>
</div>
<br/>
<div class="divider"></div>
<br/> <br/>
<header>
<h2 style="font-size:28px">Summer/Autumn 2021 Calendar</h2>
</header>
<br/>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-zamaraeva">[14/05/2021] Olga Zamaraeva: Typologically-driven Modeling of wh-Questions in a Grammar Engineering Framework</a>
</h4>
</div>
<div id="lecture-zamaraeva" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Olga Zamaraeva: Typologically-driven Modeling of wh-Questions in a Grammar Engineering Framework</p>
<p align="left">
Studying language typology and studying syntactic structure formally are both ways to learn about the range of variation in human languages. These two ways are often pursued separately from each other. Furthermore, assembling the complex and fragmented hypotheses about different syntactic phenomena along multiple typological dimensions becomes intractable without computational aid. In response to these issues, the Grammar Matrix grammar engineering framework combines typology and syntactic theory within a computational paradigm. As such, it offers a robust scaffolding for testing linguistic hypotheses in interaction and with respect to a clear area of applicability. In this talk, I will present my recent work on modeling the syntactic structure of constituent (wh-)questions in a typologically attested range, within the Grammar Matrix framework. The presented system of syntactic analyses is associated with grammar artifacts that can parse and generate sentences, which allowed me to rigorously test the analyses on test suites from diverse languages. The grammars can be extended directly in the future to cover more phenomena and more lexical items. Generally, the Grammar Matrix framework is intended to create implemented grammars for many languages of the world, particularly for endangered languages. In computational linguistics, formalized syntactic representations produced by such grammars play a crucial role in creating annotations which are then used for evaluating NLP system performance and which could be used for augmenting training data as well, in low-resource settings. Such grammars were also shown to be useful in applications such as grammar coaching, and advancing this line of research can contribute to educational and revitalization efforts.
The talk comprises 4 parts (one hour in total), there will be Q&A sessions after each:
1) Introduction (focusing on NLP and language variation)
2) Computational syntax with HPSG
3) Assembling typologically diverse analyses
4) Future directions of research
</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/e-qigsenkuI" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="lectures/slides/olga-zamaraeva-sigtyp.pdf" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1393418075337203718" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-zamaraeva" class="button small">RocketChat</a> <a href="https://olzama.github.io/" class="button small">Olga's Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-rawski">[04/06/2021] Jon Rawski: Typology Emerges from Computability</a>
</h4>
</div>
<div id="lecture-rawski" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Jon Rawski: Typology Emerges from Computability</p>
<p align="left">
Typology, from the ancient Sanskrit grammarians through to Alexander von Humboldt, is known to require two databases: an "encyclopedia of categories" and an "encyclopedia of types". The mathematical study of computable functions gives a rich encyclopedia of categories, and processes in natural language a rich encyclopedia of types. This talk will connect the two, especially in morphology and phonology.
Jon will:
1) overview classes of string-to-string functions (polyregular, regular, rational and subsequential);
2) use them to determine the scope and limits of linguistic processes;
3) analytically connect them to classes of transducers (and acceptors using algebraic semirings);
4) show their usefulness for Seq2Seq interpretability experiments, and implications for ML in NLP generally.
</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/LoV_62BXc0A" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="lectures/slides/jon-rawski-sigtyp.pdf" class="button small">Slides</a> <a href="TBA" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-rawski" class="button small">RocketChat</a> <a href="https://www.jrawski.info/" class="button small">Jon's Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-pimentel">[11/06/2021] Tiago Pimentel: An Informative Exploration of the Lexicon</a>
</h4>
</div>
<div id="lecture-pimentel" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Tiago Pimentel: An Informative Exploration of the Lexicon</p>
<p align="left">
During my PhD I've been exploring the lexicon through the lens of information theory. In this talk, I'll give an overview on results detailing the distribution of information in words (are initial or final positions more informative?), and cross-linguistic compensations (if a language has more information per character, are their words shorter?). I'll also present two new information-theoretic operationalisations (of systematicity and lexical ambiguity) which allow us to analyse computational linguistics question through corpus analyses -- relying only on natural (unsupervised) data.
</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/WxFfF4RKFuw" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="lectures/slides/pimentel_sigtyp2021.pdf" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1403272190242394114" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-pimentel" class="button small">RocketChat</a> <a href="https://TBA" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-ryskina">[18/06/2021] Maria Ryskina: Informal Romanization Across Languages and Scripts</a>
</h4>
</div>
<div id="lecture-ryskina" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Maria Ryskina: Informal Romanization Across Languages and Scripts</p>
<p align="left">
Informal romanization is an idiosyncratic way of typing non-Latin-script languages in Latin alphabet, commonly used in online communication. Although the character substitution choices vary between users, they are typically grounded in shared notions of visual and phonetic similarity between characters. In this talk, I will focus on the task of converting such romanized text into its native orthography and present experimental results for Russian, Arabic, and Kannada, highlighting the differences specific to writing systems. I will also show how similarity-encoding inductive bias helps in the absence of parallel data, present comparative error analysis for unsupervised finite-state and seq2seq models for this task, and explore how the combinations of the two model classes can leverage their different strengths.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/ZCgbancq4zc" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="lectures/slides/Ryskina_SIGTYP.pdf" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1406890518471081984" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-ryskina" class="button small">RocketChat</a> <a href="https://TBA" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-rijhwani">[25/06/2021] Shruti Rijhwani: Cross-Lingual Entity Linking for Low-Resource Languages</a>
</h4>
</div>
<div id="lecture-rijhwani" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Shruti Rijhwani: Cross-Lingual Entity Linking for Low-Resource Languages</p>
<p align="left">
Entity linking is the task of associating a named entity with its corresponding entry in a structured knowledge base (such as Wikipedia or Freebase). While entity linking systems for languages such as English and Spanish are well-developed, the performance of these methods on low-resource languages is significantly worse.
<br/>
In this talk, I first discuss existing methods for cross-lingual entity linking and the associated challenges of adapting them to low-resource languages. Then, I present a suite of methods developed for entity linking that do not rely on resources in the target language. The success of our proposed methods is demonstrated with experiments on multiple languages, including extremely low-resource languages such as Tigrinya, Oromo, and Lao. Additionally, this talk will show how information from entity linking can be used with state-of-the-art neural models to improve low-resource named entity recognition.
</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/3aOhygB9YOE" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="lectures/slides/rijhwani_sigtyp.pdf" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1408408601354403840" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-rijhwani" class="button small">RocketChat</a> <a href="https://TBA" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-inman">[02/07/2021] David Inman: Conceptual Interdependence in Language Description, Typology, and NLP: Examples from Nuuchahnulth</a>
</h4>
</div>
<div id="lecture-inman" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">David Inman: Conceptual Interdependence in Language Description, Typology, and NLP: Examples from Nuuchahnulth</p>
<p align="left">
The fields of language description, typology, and NLP can be and typically are pursued independently. However, approaching these from a perspective of interdependence reveals that methodologies in one can often answer or refine questions in another. Focusing on the example of coordination structures in Nuuchahnulth, a Wakashan language of British Columbia, I will walk through the connection among traditional linguistic fields and NLP, how these can inform each other, and why NLP researchers should be interested.
<br/>
Bio:
<br/>
David Inman's research is centered on Indigenous American languages, their linguistic properties, history, and typological profile. His doctoral research utilized computational tools to document properties of Nuuchahnulth, a Wakashan language spoken in Canada, and he continues investigating the challenges to syntactic theory that this language presents. At the University of Zurich, he is developing typological questionnaires targeting areal patterns in the Americas, and investigating how these overlap to produce areas of historically intense linguistic contact.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/F9mcLJaxqq8" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1410880708755660805" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-inman" class="button small">RocketChat</a> <a href="https://TBA" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-chakrabarty">[09/07/2021] Tuhin Chakrabarty: NeuroSymbolic methods for creative text generation</a>
</h4>
</div>
<div id="lecture-chakrabarty" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Tuhin Chakrabarty: NeuroSymbolic methods for creative text generation</p>
<p align="left">
Recent neural models have led to important progress in natural language generation (NLG) tasks. While pre-trained models have facilitated advances in many areas of text generation, the fields of creative language generation especially figurative language are relatively unexplored. There are important challenges that need to be addressed such as the lack of a large amount of training data as well as the inherent need for common sense and connotative knowledge required for modeling these tasks. In this talk, I will present some of my recent work on neurosymbolic methods for controllable creative text generation focusing on various types of figurative language (e.g. metaphor, simile, sarcasm). Additionally, I will discuss how we can borrow from theoretically grounded concepts of figurative language and use these inductive biases to make our generations closer to humans.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/DJK27LX1mns" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1417073606924783617" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-chakrabarty" class="button small">RocketChat</a> <a href="https://TBA" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-mielke">[16/07/2021] Sabrina Mielke: Fair Comparisons for Generative Language Models -- with a bit of Information Theory</a>
</h4>
</div>
<div id="lecture-mielke" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Sabrina Mielke: Fair Comparisons for Generative Language Models -- with a bit of Information Theory</p>
<p align="left">
How can we fairly compare the performance of generative models on multiple languages? We will see how to use probabilistic and information theory-based measures, first to evaluate (monolingual) open-vocabulary language models by total bits and then pondering the meaning of “information” and how to use it to compare machine translation models. In both cases, we get only a little glimpse at what might make languages easier or harder for models, but deviating from the polished conference talk, I will recount how I spent half a year on a super-fancy model that yielded essentially the same conclusions as a simple averaging step... The rest of the talk will be dedicated to work on actually building new open-vocabulary language models, and on evaluating and ameliorating such models' gender bias in morphologically rich languages.
<br/>
Bio: <br/>
Sabrina is a PhD student at the Johns Hopkins University and a part-time research intern at HuggingFace, researching open-vocabulary language models for segmentation and tokenization. She has published and co-organized workshops and shared tasks on these topics as well as on morphology and typological analysis in ACL, NAACL, EMNLP, LREC, and AAAI. You can find her reminisce for a time when formal language theory played a bigger role in NLP on Twitter at @sjmielke.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/ZNg-fJXYtow" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1417126589767188482" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-mielke" class="button small">RocketChat</a> <a href="https://sjmielke.com/" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-futrell">[23/07/2021] Richard Futrell: Investigating Information-Theoretic Influences on the Order of Elements in Natural Language</a>
</h4>
</div>
<div id="lecture-futrell" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Richard Futrell: Investigating Information-Theoretic Influences on the Order of Elements in Natural Language</p>
<p align="left">
Why is human language the way it is? I claim that human languages can be modeled as codes that maximize information transfer subject to constraints on the process of language production and comprehension. I use this efficiency-based framework to formulate quantitative theories of the order of words, phrases, and morphemes, aiming to explain the typological universals documented by linguists as well as the statistical distribution of orders in massively cross-linguistic corpus studies. I present results about Greenbergian word order correlations, adjective order in English, and the order of verbal dependents in Hindi.
<br/>
Bio: <br/>
Richard Futrell is an Assistant Professor in the Department of Language Science at the University of California, Irvine. His research focuses on language processing in humans and machines.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/i7f89U04yUI" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1418900937045807105" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-futrell" class="button small">RocketChat</a> <a href="https://TBA" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-chodroff">[30/07/2021] Eleanor Chodroff: Structure in Cross-linguistic Phonetic Realization</a>
</h4>
</div>
<div id="lecture-chodroff" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Eleanor Chodroff: Structure in Cross-linguistic Phonetic Realization</p>
<p align="left">
A central goal of linguistic study is to understand the range and limits of cross-linguistic variation. Cross-linguistic phonetic variation is no exception to this pursuit: previous research has provided some insight into expected universal tendencies, but access to relevant and large-scale speech data has only recently become feasible. In this talk, I focus on structure in cross-linguistic phonetic variation that may reflect a universal tendency for uniformity in the phonetic realisation of a shared feature. I present case studies from cross-talker variation within a language, and then insight from cross-linguistic meta-analyses and larger-scale corpus studies.
<br/>
Bio:<br/> Eleanor Chodroff is a Lecturer in Phonetics and Phonology at the University of York. She received her PhD in Cognitive Science from Johns Hopkins University in 2017 and did a post-doc at Northwestern University in Linguistics working on speech prosody. Her research focuses on the phonetics–phonology interface, cross-talker and cross-linguistic phonetic variation, speech prosody, and speech perception.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/d7ul6YQx5rI" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1427908058072502273" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-chodroff" class="button small">RocketChat</a> <a href="https://www.eleanorchodroff.com/" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-moryossef">[05/08/2021] Amit Moryossef: Including Signed Languages in NLP</a>
</h4>
</div>
<div id="lecture-moryossef" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Amit Moryossef: Including Signed Languages in NLP</p>
<p align="left">
Signed languages are the primary means of communication for many deaf and hard-of-hearing individuals. Since signed languages exhibit all the fundamental linguistic properties of natural language, I believe that tools and theories of Natural Language Processing (NLP) are crucial to its modeling. However, existing research in Sign Language Processing (SLP) seldom attempts to explore and leverage the linguistic organization of signed languages.
In this talk, I discuss the linguistic properties of signed languages, the current open questions and challenges in modeling them, and present my current research to mitigate them.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/hNvzT9_Rb1s" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1429020546751111170" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-moryossef" class="button small">RocketChat</a> <a href="https://scholar.google.co.il/citations?user=Aaj_RBEAAAAJ&hl=en" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-ataman">[13/08/2021] Duygu Ataman: Machine Translation of Morphologically-Rich Languages: a Survey and Open Challenges</a>
</h4>
</div>
<div id="lecture-ataman" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Duygu Ataman: Machine Translation of Morphologically-Rich Languages: a Survey and Open Challenges</p>
<p align="left">
Morphologically-rich languages challenge neural machine translation (NMT) models with extremely sparse vocabularies where atomic treatment of surface forms is unrealistic. This problem is typically addressed by either pre-processing words into subword units or performing translation directly at the level of characters. The former is based on word segmentation algorithms optimized using corpus-level statistics with no regard to the translation task. The latter approach has shown significant benefits for translating morphologically-rich languages, although practical applications are still limited due to increased requirements in terms of model capacity. In this talk, we present an overview of recent approaches to NMT developed for translating morphologically-rich languages and open challenges related to their future deployment.
<br/>
Bio:<br/>
Duygu Ataman holds a bachelor's and a master's degree in electrical engineering and computer science from Middle East Technical University, and KU Leuven, respectively. She completed her Ph.D. in computer science in 2019 at the University of Trento under the supervision of Marcello Federico. In her doctoral research she studied unsupervised learning of morphology, from the aspects of linguistics, cognitive science and statistics, and designed a purely statistical formulation of it within the Bayesian framework, which could be implemented in decoders of neural machine translation models in order to generate better translations in morphologically-rich languages. During her Ph.D. she was also a visiting student at the School of Informatics, University of Edinburgh advised by Dr. Alexandra Birch, and an applied scientist intern at Amazon Alexa Research. After recently completing her post-doctoral research and studies at the Institute of Computational Linguistics, University of Zürich she will soon join New York University's Courant Institute as an assistant professor and faculty fellow.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/9m8InjiWGoE" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1429743883018141701" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-ataman" class="button small">RocketChat</a> <a href="https://TBA" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-vylomova">[20/08/2021] Ekaterina Vylomova: UniMorph and Morphological Inflection Task: Past, Present, and Future</a>
</h4>
</div>
<div id="lecture-vylomova" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Ekaterina Vylomova: UniMorph and Morphological Inflection Task: Past, Present, and Future</p>
<p align="left">
In the 1960s, Hockett proposed a set of essential properties that are unique to human language such as displacement, productivity, duality of patterning, and learnability. Regardless of the language we use, these features allow us to produce new utterances and infer their meanings. Still, languages differ in the way they express meanings, or as Jacobson put it, “Languages differ essentially in what they must convey and not in what they may convey”. From a typological point of view, it is crucial to describe and understand the limits of cross-linguistic variation. In this talk, I will focus on cross-lingual annotation and regularities in inflectional morphology. More specifically, I will discuss the UniMorph project, an attempt to create a universal (cross-lingual) annotation schema, with morphosyntactic features that would occupy an intermediate position between the descriptive categories and comparative concepts. UniMorph allows an inflected word from any language to be defined by its lexical meaning, typically carried by the lemma, and a bundle of universal morphological features defined by the schema. Since 2016, the UniMorph database has been gradually developed and updated with new languages, and SIGMORPHON shared tasks served as a platform to compare computational models of inflectional morphology. During 2016–2021, the shared tasks made it possible to explore the data-driven systems’ ability to learn declension and conjugation paradigms as well as to evaluate how well they generalize across typologically diverse languages. It is especially important, since elaboration of formal techniques of cross-language generalization and prediction of universal entities across related languages should provide a new potential to the modeling of under-resourced and endangered languages. In the second part of the talk, I will outline certain challenges we faced while converting the language-specific features into UniMorph (such as case compounding). In addition, I will also discuss typical errors made by the majority of the systems, e.g. incorrectly predicted instances due to allomorphy, form variation, misspelled words, looping effects. Finally, I will provide case studies for Russian, Tibetan, and Nen.
<br/>
Bio:<br/>
Ekaterina Vylomova is a Lecturer and a Postdoctoral Fellow at the University of Melbourne. Her research is focused on compositionality modelling for morphology, models of inflectional and derivational morphology, linguistic typology, diachronic language models, and neural machine translation. She co-organized SIGTYP 2019 – 2021 workshops and shared tasks and the SIGMORPHON 2017 – 2021 shared tasks on morphological reinflection.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/q5YpLLhW-ck" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="https://www.slideshare.net/ivrik/the-unimorph-project-and-morphological-reinflection-task-past-present-and-future" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1430873751436992517" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-vylomova" class="button small">RocketChat</a> <a href="http://kat.academy" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-williams">[27/08/2021] Adina Williams: How Strongly does Grammatical Gender Correlate with the Lexical Semantics of Nouns?</a>
</h4>
</div>
<div id="lecture-williams" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Adina Williams: How Strongly does Grammatical Gender Correlate with the Lexical Semantics of Nouns?</p>
<p align="left">
Since at least Ferdinand de Saussure, linguists have aimed to understand the strength and substance of the relationship between word meaning and word form. In this talk, I present several works that explore one particular aspect of this long standing research program: grammatical gender. In particular, this presentation asks the following question: is there a statistically significant relationship between the morphological gender of a noun and its lexical meaning? I will present three recent studies that answer this question in the affirmative. These works measure the strength of the correlation between grammatical gender and several operationalizations of lexical meaning (using collocations and word embeddings). They also explore the relationship between meaning and orthographic form, uncovering related correlations for other grammatical systems (such as declension class). These works highlight how technical advancements in multilingual NLP tools and increasing availability of large text corpora can shed light on some of the most enduring questions about the nature of language.
<br/>
Bio:<br/> Adina is a Research Scientist at Facebook AI Research in NYC. Her main research goal is to strengthen connections between linguistics, cognitive science, and natural language processing. Towards that end, she brings insights about human language to bear on training, evaluating, and debiasing ML-based NLP systems, and applies tools from NLP to uncover new facts about human language.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/JRJHCeaLBEo" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1431615234452590601" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-williams" class="button small">RocketChat</a> <a href="hhttps://ai.facebook.com/people/adina-williams/" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-gorman">[03/09/2021] Kyle Gorman: On "Massively Multilingual" Natural Language Processing</a>
</h4>
</div>
<div id="lecture-gorman" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Kyle Gorman: On "Massively Multilingual" Natural Language Processing</p>
<p align="left">
Early work in speech & language processing was critiqued for an overwhelming focus on English (and a few other regionally hegemonic languages). In part, this reflected resource limitations of the time. In the first half of this talk, I will discuss various ways in which speech & language processing technologies can be said to be "monolingual" or "multilingual". I will identify several distinct tendencies pushing the field towards greater multilinguality and note some tensions between these various tendencies. In the second half of the talk I will discuss some of the work out of my lab exploiting free, massively multilingual data extracted from Wiktionary, a free online dictionary. These resources include UniMorph, a collection of morphological paradigms, and WikiPron, a collection of pronunciation dictionaries. I will discuss how these data are collected and vetted, and their use in a series of recent shared tasks hosted by special interest groups of the Association for Computational Linguistics.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/0F8I1r8dP1U" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1434166009309523972" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-gorman" class="button small">RocketChat</a> <a href="http://wellformedness.com/" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-mahowald">[10/09/2021] Kyle Mahowald: “Deep” Subjecthood: Classifying Grammatical Subjects and Objects across Languages</a>
</h4>
</div>
<div id="lecture-mahowald" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Kyle Mahowald: “Deep” Subjecthood: Classifying Grammatical Subjects and Objects across Languages</p>
<p align="left">
What do contextual embedding models know about grammatical subjects and objects, and how does that knowledge vary typologically? To explore that question, I will present a variety of results, probing both humans and machines using a bespoke subject/object classification task. In the first part of the talk, I will show that type-level embeddings can explain a large part of the variance in whether a given noun is a subject, but that there are cases in which contextual models play a crucial role. In the second part of the talk, I explore subject/object classification in Multilingual BERT on both transitive and intransitive sentences, across languages that vary in morphosyntactic alignment. In particular, I explore how a classifier trained on transitive subjects and objects classifies held-out intransitive subjects, comparing the model performance within and across nominative/accusative and ergative/absolutive languages. I consider the implications of these results for linguistic theories of subjecthood.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/q6hkp1zkI1I" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1438449919409328133" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-mahowald" class="button small">RocketChat</a> <a href="https://mahowak.github.io/" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-yin">[17/09/2021] Kayo Yin: Understanding, Improving and Evaluating Context Usage in Context-aware Machine Translation</a>
</h4>
</div>
<div id="lecture-yin" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Kayo Yin: Understanding, Improving and Evaluating Context Usage in Context-aware Machine Translation</p>
<p align="left">
Context-aware Neural Machine Translation (NMT) models have been proposed to perform document-level translation, where certain words require information from the previous sentences to be translated accurately. However, these models are unable to use context adequately and often fail to translate relatively simple discourse phenomena. In this talk, I will discuss methods to measure context usage in NMT by using human annotations and conditional cross mutual information, as well as training methods to improve context usage by supervising attention and performing contextual word dropout. I will also discuss ways to identify words that require context to translate and how to evaluate NMT models on these ambiguous phenomena, and present open challenges in document-level translation.
<br/>
Bio: <br/>
Kayo Yin is a 2nd year Master's student at Carnegie Mellon University advised by Prof. Graham Neubig. Her research focuses on developing machine translation models that can break down communication barriers between different language users while ensuring everyone can benefit language technologies in their preferred language. She is working on identifying and resolving translation ambiguities that arise in document-level translation, as well as developing NLP models that can process signed languages. </p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/dgCjT0M7Osc" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1447463521990574080" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-yin" class="button small">RocketChat</a> <a href="https://kayoyin.github.io/" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-samardzic">[24/09/2021] Tanja Samardžić: Language Sampling</a>
</h4>
</div>
<div id="lecture-samardzic" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Tanja Samardžić: Language Sampling</p>
<p align="left">
Whenever we perform an experiment to test a model or a hypothesis, we need to decide what data to include. In NLP and linguistics, this means selecting a number of languages and a number of examples (sounds, words, utterances) from each language. How do we take this decision? In theory, this decision is part of the study design: we should select a proper sample to represent our target population. In practice, however, our decisions tend to be driven by many different factors such as data availability, our familiarity with a language, ease of processing, but also factors such as political views and ethical concerns. While there is a growing awareness of biases potentially introduced by such factors, designing proper samples remains an open challenge. In this talk, I will review the most common data sampling criteria in NLP and linguistic, discuss several methods for dealing with biases (e.g. maximising diversity, controlling for phylogenetic dependence) and propose a few improvements for future work.
<br/>
Bio:<br/>
Tanja a computational linguist with a background in language theory and machine learning, currently a lecturer (Privatdozentin) and a group leader at the University of Zurich. Her research is about developing computational text processing methods and using them to test theoretical hypotheses on how language actually works. She holds a PhD in Computational linguistics from the University of Geneva, where she studied in the group Computational Learning and Computational Linguistics (CLCL). She is committed to promoting and facilitating the use of computational approaches in the study of language. </p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/zVp_eOHcuJU" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1447937107029336081" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-samardzic" class="button small">RocketChat</a> <a href="https://www.spur.uzh.ch/en/departments/personen/tanjasamardzic.html" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-karamolegkou-stymne">[01/10/2021] Antonia Karamolegkou and Sara Stymne: Transfer Language Choice for Cross-Lingual Dependency Parsing</a>
</h4>
</div>
<div id="lecture-karamolegkou-stymne" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Antonia Karamolegkou and Sara Stymne: Transfer Language Choice for Cross-Lingual Dependency Parsing</p>
<p align="left">
Lately there has been an increasing amount of work on cross-lingual learning, and how models for a target language, often with few resources, can be improved by using data from other languages. In this talk we will focus on cross-lingual dependency parsing, where the Universal Dependency treebanks serves as a great test bed, containing harmonized annotations for a diverse set of languages. Specifically, we focus on the case where we focus on specific target languages, and especially on how to choose good transfer languages. We explore the impact of a number of language similarity features on this choice, including geneological, geographic and syntactic similarity, and also the impact of different text types and training set size. We present three studies on different aspects. In our first study we focus on Latin, and compare transfer languages from the Hellenic and Italian families. In our second study we investigate transfer language choice for a more diverse set of target and transfer languages. In our third study we investigate the performance when targeting speech and Twitter data.
<br/>
Bio:<br/>
Antonia Karamolegkou is a postgraduate student in computational linguistics. She received her master's degree from Uppsala University with a thesis on argument mining. She is currently working as a software engineer in an Informatics and Technology company.
<br/>
Sara Stymne is assistant professor in computational linguistics at Uppsala University. She received her PhD from Linköping University with a thesis on machine translation. Her current main research focus is on cross-lingual methods for dependency parsing, with a special interest in the impact of the domain of training data.</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/_GIs5xsIOyo" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1453650656108040198" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-stymne-karamolegkou" class="button small">RocketChat</a> <a href="" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-mueller">[08/10/2021] Mathias Müller: Exploring a Sampling-based Alternative to Beam Search</a>
</h4>
</div>
<div id="lecture-mueller" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Mathias Müller: Exploring a Sampling-based Alternative to Beam Search</p>
<p align="left">
In this talk I will discuss the potential of Minimum Bayes Risk (MBR) decoding -- a sampling-based decoding algorithm -- to replace beam search in machine translation.
Beam search is the de-facto standard decoding algorithm for many language generation problems. However, recent work has found that beam search itself causes or exacerbates well-known biases in machine translation. Minimum Bayes Risk (MBR) decoding was suggested as an alternative algorithm that does not search for the highest-scoring translation but operates on a pool of samples.
I will highlight that MBR does not alleviate well-known biases in machine translation, but, interestingly, increases the robustness to noise in the training data and to domain shift.
<br/>
Bio:<br/>
Mathias is a post-doc and lecturer at the University of Zurich. His current main interests are 1) the meta-sciences of scientific integrity, methodology and reproducibility applied to machine translation, 2) decoding algorithms and 3) sign language translation. In his personal life he is a father of two and a passionate musician.
<br/>
Please find more information about him here: <a href="https://www.cl.uzh.ch/de/people/team/compling/mmueller.html">cl.uzh.ch/mmueller</a>
</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/sqTvsa19PtE" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="https://docs.google.com/presentation/d/11prGI8kM0N6_hDi_3b6lCuqsTGeafwuXVGZmNSUfCs0/edit?usp=sharing" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1456554346540257283" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-mueller" class="button small">RocketChat</a> <a href="https://www.cl.uzh.ch/de/people/team/compling/mmueller.html" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-anastasopoulos">[15/10/2021] Antonis Anastasopoulos: Extracting Linguistic Information from Text</a>
</h4>
</div>
<div id="lecture-anastasopoulos" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Antonis Anastasopoulos: Extracting Linguistic Information from Text</p>
<p align="left">
In this talk I'll synthesize several works from a series of papers that focus on extracting elements of a descriptive grammar of a language directly from text. I'll focus on morphosyntactic rules, as well as on identifying interesting semantic subdivisions. I'll also talk about how these "rules" can be used for the evaluation of natural language generation systems, as well as potentially aid language learners.
<br/><br/>
Bio: <br/>Antonios Anastasopoulos is an Assistant Professor in Computer Science at George Mason University. He received his PhD in Computer Science from the University of Notre Dame, advised by David Chiang and then did a postdoc at Languages Technologies Institute at Carnegie Mellon University. His research is on natural language processing with a focus on low-resource settings, endangered languages, and cross-lingual learning, and is currently funded by the National Science Foundation, the National Endowment for the Humanities, Google, Amazon, and the Virginia Research Investment Fund.
</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/Rt4kSHx1Mxs" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1457300250347991043" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-anastasopoulos" class="button small">RocketChat</a> <a href="https://cs.gmu.edu/~antonis/author/antonios-anastasopoulos/" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-ravfogel">[21/10/2021] Shauli Ravfogel: Linear Information Removal Methods</a>
</h4>
</div>
<div id="lecture-ravfogel" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Shauli Ravfogel: Linear Information Removal Methods</p>
<p align="left">
I will present Iterative Nullspace Projection (INLP), a method to identify subspaces within the representation space of neural LMs that correspond to arbitrary human-interpretable concepts such as gender or syntactic function. The method is data-driven and identifies those subspaces by the training of multiple orthogonal classifiers to predict the concept at focus. I will overview some recent work of ours, which demonstrates the utility of these concept subspaces for different goals: mitigating social bias in static and contextualized embeddings and assessing the influence of concepts on the model's behavior. I will then describe an ongoing work which studies the theoretical aspects of this method, especially with regard to its optimality, and propose an alternative which uses a relaxed formulation of adversarial training.
<br/><br/>
Bio: <br/>I am starting my second year as a PhD student at Bar Ilan University (Supervised by prof. Yoav Goldberg). I am interested in representation learning, analysis and interpretability of neural models, and the syntactic abilities of NNs. Specifically, I am interested in the way neural models learn distributed representations that encode structured information, in the way they utilize those representations to solve tasks, and in our ability to control their content and map them back to interpretable concepts. During my master's I have mainly worked on the ability of NNs to acquire syntax in typologically-diverse languages, and during my PhD so far I've been working on developing tools to remove information from neural representations in a controlled manner.
</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/y-2B2y4jI8U" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="https://twitter.com/sig_typ/status/1474819620842487812" class="button small">Twitter Summary</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-ravfogel" class="button small">RocketChat</a> <a href="https://shauli-ravfogel.netlify.app/" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-bowern">[29/10/2021] Claire Bowern: Linguistics and Voynichese</a>
</h4>
</div>
<div id="lecture-bowern" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">
<p align="left" style="color:red;font-size:16px">Claire Bowern: Linguistics and Voynichese</p>
<p align="left">
In this talk I give a brief overview of work that has been done so far on figuring out whether there is linguistic "signal" underlying the text of the Voynich Manuscript (Beinecke MS 408), an early 15th Century document which has so far been impossible to read. I discuss research which aims to tell language from non-language, outline some issues with doing research on Voynichese, and discuss some possibilities for future work.
</p>
<p align="center"><a href="TBA" class="button small">Slides</a> <a href="TBA" class="button small">BiliBili</a> <a href="https://sigtyp.inf.ethz.ch/channel/lecture-bowern" class="button small">RocketChat</a> <a href="https://ling.yale.edu/people/claire-bowern" class="button small">Website</a></p>
</div>
</div>
</div>
<div class="panel panel-default">
<div class="panel-heading">
<h4 class="panel-title">
<a data-toggle="collapse" data-parent="#accordion" href="#lecture-weber">[05/11/2021] Sabine Weber: Using the Distributional Inclusion Hypothesis for Unsupervised Entailment Detection</a>
</h4>
</div>
<div id="lecture-weber" class="panel-collapse collapse">
<div class="panel-body" style="align: center; text-align:center">