From corpora-request@lists.uib.no Tue Apr 22 00:32:19 1997
Date: Mon, 21 Apr 1997 11:37:38 -0400
From: isabelle@citi.doc.ca (P. Isabelle [TAO])
Subject: Re: Hello everybody !!! (looking for parallel corpora)




> 
> >>>>> "sj" == Samuel JOLIBOIS  writes:
> 
>     sj> To do an experiment, I need a
>     sj> HUGE (200 Mo) amount of bilingual parallel corpus, in english
>     sj> and french...
> 
> 
> the canadian hansards are the classic source for this sort of
> information.  i note however, that they are not available from the LDC
> (i could have sworn they were on the ACL/DCI disk).
> 
> 
> but, researchers can look at:
> 
> http://www.parl.gc.ca/english/senate/deb-e/deb-e.htm
> 
> and
> 
> http://www.parl.gc.ca/english/senate/deb-e/deb-f.htm
> 
> 
> and thereupon draw their own conclusions about the availability of the
> data.
> 
> 

Ted is right: the electronic Hansard material used by so many
researchers has in fact never been made public in an official
manner. But some data is now available on the Web site of the Canadian
parliament:

	http://www.parl.gc.ca

Note that the page mentioned by Ted provides acces to the debates of
the Senate rather than the better known debates of the House of
Commons. The latter can be found at:

	http://www.parl.gc.ca/cgi-bin/hansard/f_hansard_master.pl

Only data from the current "legislature" (that is, since January 1994)
is available there. 

	
-- 
Pierre Isabelle                            tel: (514) 973-5801
CITI, 1575 Chomedey Blvd,                  fax: (514) 973-5757
Laval, Quebec, Canada H7V 2X2              e-mail: isabelle@citi.doc.ca