cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Benedict Elliott Smith <>
Subject Re: Partition size
Date Mon, 12 Sep 2016 11:51:08 GMT
(a link to 3rd party docs in response to a question when an equivalent link
> to project hosted docs was available)

No, it wasn't.  Or at least the link you sent was not remotely the same as
the link in the email you responded to, which was about how to understand
your partition sizes - not the configuration parameter.  Possibly you
responded to the wrong email.

Any member of a project community (contributor, committer or PMC member)

Right.  But policing *users* (which Mark most certainly is) is just
douchebaggery.  Users should feel free to participate with the resources *they
know best *without fear of reprisal.  All of your statement suggests this
shit belongs on the dev list.

Or are we really suggesting that anyone discussing things on the user list
must be 100% conversant with the "official" docs before they can make any
kind of posting to the list?  Or otherwise they can expect to be attacked
by other community members?

Talk about chilling.  I do not see this promoting engagement - who wants to
help other users out if this is what they can expect in return?  A public

Linking to third party docs, blogs, etc is fairly common but they tend to
> be linked by the OP in the form of "I've followed the instructions I found
> here and it doesn't work".

Bullshit. Try a simple google search: site:

There are 500 results.  For just one external resource.  I don't recall a
single one of these resulting in a reprimand.  Try the first three links
from the search - they do not fit *any* of your characterisations of
"normal" - but they do fit mine.

Perhaps you can link the history of projects attacking users for their
email content?

On 12 September 2016 at 12:10, Mark Thomas <> wrote:

> On 09/09/2016 21:11, Benedict Elliott Smith wrote:
> > Come on. This kind of inconsistent 'policing' is not helpful.
> How is it inconsistent? Since I subscribed to the mailing list on 22
> August, this is the first instance I have seen of anyone providing a
> link to third party docs rather than the equivalent project hosted docs
> in response to a user question. If I missed any, please point them out.
> The lists are pretty busy and that, combined with my minimal technical
> knowledge of Cassandra, means it is perfectly possible I missed some.
> I've done a quick double check of the user@ archives and while I do see
> a number of messages referencing 3rd party docs, those references were
> made by the OP rather than someone from the community providing an answer.
> > By all means, push the /*committers*/ to improve the project docs as is
> > happening, and to promote the internal resources over external ones.
> >
> > But Mark has absolutely no formal connection with the project, and his
> > contributions have only been to file a couple of JIRA (all of which have
> > so far been ignored by those of his colleagues who /are/ active
> > community members, I'll note!).  Shaming him for not linking docs that
> > describe something /other/ than what he was even talking about is
> > crossing the line IMO.
> Any member of a project community (contributor, committer or PMC member)
> directing users to 3rd party docs in preference to project docs without
> a good reason is missing an opportunity to strengthen that project
> community.
> > Linking to third-party resources is commonplace, the only difference I
> > can see here is that these have been called "docs"  by the authors,
> > instead of a blog post, and Mark has a DataStax email address.
> Linking to third party reference docs for an Apache project in response
> to a configuration question about that Apache project on one of the
> project's mailing lists is pretty unusual.
> Linking to third party docs, blogs, etc is fairly common but they tend
> to be linked by the OP in the form of "I've followed the instructions I
> found here and it doesn't work". The responses to such questions
> typically include links to the relevant parts of the Apache hosted docs.
> If the question is more involved then I have seen links to blogs,
> presentations, YouTube etc provided as an answer. If this happens
> multiple times for the same topic then it is usually added to an FAQ,
> wiki or similar along with an e-mail to the author to see if they'd be
> willing to contribute something to the docs.
> > Would you have reacted this way if Aaron Morton linked a blog post by
> > thelastpickle?  Or a random user posted their own resources?  Obviously
> not.
> Wrong. My reaction was based on the content of the message (a link to
> 3rd party docs in response to a question when an equivalent link to
> project hosted docs was available) not on who sent it or their employer.
> > I was initially all for the ASF endeavour to counteract DataStax'
> > outsized influence on the project, and was hopeful you might achieve
> > some positive change.  Perhaps you may well still do.  But it seems to
> > me that the ASF behaviour is beginning to cross from constructive
> > criticism of the project participants to prejudicially hostile behaviour
> > against certain community members - and that is unlikely to result in a
> > better project.
> >
> > You should be treating everyone consistently, in a manner that promotes
> > project health.
> It is not healthy if community members are directing users to 3rd party
> documentation in preference to the project's own documentation. If it is
> happening because the project's documentation is non-existent / wrong /
> poorly written / etc. then that is understandable (and would be an issue
> the project needed to address) but that was not the case in this instance.
> There are many aspects to community health. In the grand scheme of
> things the single e-mail that started this particular discussion is in
> the noise. However, a consistent pattern of such e-mails would be much
> more troubling. My intent was to ensure that such a pattern did not form.
> Whether people agree with my response or not, the community is hopefully
> more aware of the issue than it was previously.
> Mark
> > On Friday, 9 September 2016, Mark Thomas <
> > <>> wrote:
> >
> >     On 09/09/2016 16:46, Mark Curtis wrote:
> >     > If your partition sizes are over 100MB iirc then you'll normally
> see
> >     > warnings in your system.log, this will outline the partition key,
> at
> >     > least in Cassandra 2.0 and 2.1 as I recall.
> >     >
> >     > Your best friend here is nodetool cfstats which shows you the
> >     > min/mean/max partition sizes for your table. It's quite often used
> to
> >     > pinpoint large partitons on nodes in a cluster.
> >     >
> >     > More info
> >     > here:
> >
> toolsCFstats.html
> >     <
> toolsCFstats.html>
> >
> >     Folks,
> >
> >     It is *Apache* Cassandra. If you are going to point to docs, please
> >     point to the official Apache docs unless there is a very good reason
> >     not to.
> >
> >     In this case:
> >
> >
> cassandra_config_file.html#compaction_large_partition_warning_threshold_mb
> >     <
> cassandra_config_file.html#compaction_large_partition_warning_threshold_mb
> >
> >
> >     looks to the place.
> >
> >     Mark
> >
> >
> >     >
> >     > Thanks
> >     >
> >     > Mark
> >     >
> >     >
> >     > On 9 September 2016 at 02:53, Anshu Vajpayee <
> >     > <>> wrote:
> >     >
> >     >     Is there any way to get partition size for a  partition key ?
> >     >
> >     >
> >

View raw message