An appeal for more bandwidth to the Internet Archive
nuclearcat at nuclearcat.com
Wed May 13 07:40:36 UTC 2020
What about introducing some cache offloading, like CDN doing? (Google,
Facebook, Netflix, Akamai, etc)
I think it can be rolled pretty quickly, with minimum labor efforts, at
least for heavy content.
Maybe some opensource communities can help as well, and same scheme can
be applied then to other non-profits.
But sure something more smooth like nginx caching, not bunch of
rsync/ssh scripts, as many Linux mirrors have.
On 2020-05-13 08:25, Tim Požár wrote:
> Internet Archive primary office is located at 300 Funston in San
> Francisco. This was a Christian Science church so it has the roman
> columns you would expect for a church / library. You can see it on
> Google Street Views at:
> Although they serve content out of this site, their primary site for
> bandwidth is at 2512 Florida Ave, Richmond, CA.
> IA does have satellite offices around the world for scanning, etc.,
> the public facing servers are location in these two locations.
> On 5/12/20 9:24 PM, Terrence Koeman wrote:
>> Aren't they in a former church or something? I vaguely remember their
>> location to be significant for some reason or another. So location may
>> weigh heavily.
>> -- Regards,
>> Terrence Koeman, PhD/MTh/BPsy
>> Darkness Reigns (Holding) B.V.
>> Please quote relevant replies.
>> Spelling errors courtesy of my 'smart'phone.
>> *From:* David Hubbard <dhubbard at dino.hostasaurus.com>
>> *Sent:* Wednesday, 13 May 2020 06:02
>> *To:* nanog at nanog.org
>> *Subject:* Re: An appeal for more bandwidth to the Internet Archive
>> Could the operation be moved out of California to achieve
>> dramatically reduced operating costs and perhaps solve some
>> via cost savings vs increased donation? I have to imagine with
>> storage and processing requirements that the footprint and power
>> usage in SFO is quite costly. I have equipment in a few
>> colo's and it's easily 3x what I pay for similar in Nevada, before
>> even getting into tax abatement advantages.
>> On 5/12/20, 1:33 PM, "NANOG on behalf of colin johnston"
>> <nanog-bounces at nanog.org on behalf of colinj at gt86car.org.uk>
>> Is the increased usage due to more users or more existing
>> having higher bandwidth at home to request faster ?
>> Would be interested if IPS configured firewall used to block
>> out invalid traffic/spam traffic and if such traffic increased
>> back end network capacity increased ?
>> What countries are requesting the most data and does this
>> analysis throw up questions as to why ?
>> Are there high network usage hitters which raise question as
>> why asking for so much data time and time again and is this valid
>> traffic use ?
>> > On 12 May 2020, at 17:33, Tim Požár <pozar at lns.com> wrote:
>> > Jared...
>> > Thanks for sharing this. I was the first Director of
>> Operations from '96 to '98, at was was then Internet Archive/Alex.
>> I was the network architect back then got them their ASN and
>> original address space. Folks may also know, I help start SFMIX
>> Matt Peterson.
>> > A bit more detail in this... Some of this I got from Jonah
>> Edwards who is the current Network Architect at IA. Yes, the
>> neck was the line cards. They have upgraded and that has
>> helped the bandwidth of late.
>> > Peering would be a big help for IA. At this point they have
>> two 10Gb LAG interfaces that show up on SFMIX that was turned up
>> last February. Looking at the last couple of weeks the 95th
>> percentile on this 20Gb LAG is 3 Gb. As they just turned up on
>> SFMIX, they are just starting to get peers turned up there.
>> networks that show up on SFMIX are highly encouraged to start
>> peering with them. Alas, they are v4 only at this point.
>> > Additionally, if folks do have some fat pipes that can
>> bandwidth at 200 Paul, I am sure Jonah won't turn it down.
>> > Tim
>> > On 5/12/20 4:45 AM, Jared Brown wrote:
>> >> Hello all!
>> >> Last week the Internet Archive upgraded their bandwidth
>> from 47 Gbps to 62 Gbps. It was all gobbled up immediately.
>> a lovely solid green graph showing how usage grows vertically as
>> each interface comes online until it too is 100% saturated.
>> at the graph legend you can see that their usage for the past 24
>> hours averages 49.76G on their 50G of transport.
>> >> To see the pretty pictures follow the below link:
>> >> Relevant parts from the blog post:
>> >> "A year ago, usage was 30Gbits/sec. At the beginning of
>> year, we were at 40Gbits/sec, and we were handling it. ...
>> >> Then Covid-19 hit and demand rocketed to 50Gbits/sec and
>> overran our network infrastructure’s ability to handle it. So
>> so, our network statistics probes had difficulty collecting data
>> (hence the white spots in the graphs).
>> >> We bought a second router with new line cards, and got it
>> installed and running (and none of this is easy during a
>> and increased our capacity from 47Gbits/sec peak to 62Gbits/sec
>> peak. And we are handling it better, but it is still consumed."
>> >> It is obvious that the Internet Archive needs more
>> to power the Wayback machine and to fulfill its mission of being
>> Internet library and the historic archive of our times.
>> >> The Internet Archive is present at Digital Realty SFO (200
>> Paul) and a member of the San Francisco Metropolitan Internet
>> Exchange (SFMIX).
>> >> I appeal to all list members present or capable of getting
>> to these facilities to peer with and/or donate bandwidth to the
>> Internet Archive.
>> >> I appeal to all vendors and others with equipment that
>> can donate to the Internet Archive to contact them so that they
>> scale their services and sustain their growth.
>> >> The Internet Archive is currently running 10G equipment.
>> you can help them gain 100G connectivity, 100G routing, 100G
>> switching and/or 100G DWDM capabilities, please reach out to them.
>> They have the infrastructure and dark fiber to transition to 100G,
>> but lack the equipment. You can find the Internet Archive's
>> information below or you can contact Jonah at the Archive Org
>> directly either by email or via the contact information available
>> his Twitter profile @jonahedwards.
>> >> You can also donate at https://archive.org/donate/
>> >> The Internet Archive is a 501(c)(3) non-profit. Donations
>> are tax-deductible.
>> >> Contact information:
>> >> https://archive.org/about/contact.php
>> >> Volunteering:
>> >> https://archive.org/about/volunteerpositions.php
>> >> Disclaimer: I am not affiliated with the Internet Archive.
>> Nobody asked me to write this post. If something angers you about
>> this post, be angry at me. I merely think that the Internet
>> is a good thing and deserves our support.
>> >> Jared
More information about the NANOG